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Preface 


The aim of this book is to deal with all of the elementary methods for obtaining 
explicit solutions of ordinary differential equations, and then to introduce the ideas 
of qualitative analysis using phase plane techniques. Simple difference equations 
are also included, since their methods of solution are similar to those for linear 
differential equations. As well as being, I hope, an internally consistent choice of 
material, this selection of topics also has the advantage of preparing a student for 
a basic course on dynamical systems. 

The book arose from my unsuccessful efforts to find a suitable text to recom- 
mend when I taught the first year Warwick differential equations course. Although 
there are a number of well-established and successful textbooks that treat this sub- 
ject (these are discussed, along with other possibilities for further reading, in the 
final chapter), they seem either to include a large amount of additional material, or 
to concentrate only on the more advanced topics. I therefore produced a detailed set 
of lecture notes, which, with the encouragement of Alan Harvey and David Tranah, 
and most significantly Kenneth Blake at Cambridge University Press, eventually 
became this book. My thanks here to all those students who made useful sugges- 
tions while this book was still at the lecture note stage. 

Part I contains an informal discussion of the issues of existence and uniqueness 
of solutions, and treats the standard classes of first order differential equations 
that can be solved explicitly, as well as covering exact equations and substitution 
methods. 

The first chapter of Part II shows that two linearly independent solutions are 
needed in order to solve the general homogeneous problem, and also contains a 
brief treatment of the Wronskian. The remainder of this section treats equations 
with constant coefficients, concentrating for the most part on the second order 
case, with higher order equations discussed briefly at the end. 

Second order equations with non-constant coefficients are treated in Part III, 


Xill 


XIV Preface 


which covers reduction of order, the method of variation of constants, and series 
solutions. 

Part IV turns aside from differential equations, motivating the study of dif- 
ference equations by discussing Euler’s method of numerical solution. Constant 
coefficient linear difference equations are covered, and then there are two chapters 
devoted to nonlinear difference equations. One of these goes beyond the confines 
of an introductory course and discusses the dynamics of the logistic map in some 
detail. 

Part V treats coupled systems of two linear differential equations, starting with 
the substitution method that reduces the problem to a second order differential 
equation in one variable, the most reliable way to find explicit solutions. The re- 
mainder of this portion of the book deals with the matrix approach, showing how a 
calculation of the eigenvalues and eigenvectors of an appropriate matrix is enough 
to draw the phase portrait. This is done by changing to a coordinate system in 
which the equation is put into a standard form, providing an illustration of the 
Jordan canonical form of a matrix. 

Part VI uses the methods from Part V in order to draw the phase plane diagrams 
for a variety of nonlinear systems, with examples taken from mathematical ecology 
and simple one-dimensional particle systems, including the pendulum. The book 
ends with a brief discussion of Dulac’s criterion and the Poincaré—Bendixson The- 
orem, a chapter that investigates the complicated dynamics of the Lorenz Equa- 
tions, and suggestions for further reading. 

In addition to those already mentioned above I would like to thank various peo- 
ple who have contributed to this book. I first learned much of the material here 
from Tristram Jones-Parry at Westminster School, to whom much belated thanks 
for all his fine teaching many years ago. I also owe a debt of gratitude to all those 
who taught the course at Warwick before me, shaping its contents and therefore 
those of this book; in particular, I had useful guidance from the course notes of 
Alan Newell and Claude Baesens. I am most grateful to Andrew Stuart, who, in 
encouraging me to emphasise the links with linear algebra, made me fond of a 
subject that I still remembered with a shudder from my own undergraduate days. 
Thanks too to James Macdonald, whose ‘Swarm of flies’ program for his MMath 
project on the Lorenz equations was the inspiration behind Figure 37.8. 

Over the past two months I have been able to think of little except phase planes 
and drawing figures in MATLAB: my wife, Tania Styles, has managed to endure my 
many variations on ‘come and see this picture of a washing machine’ with a smile. 
Heartfelt thanks to her for this, and, of course, for everything. 

Finally, I would particularly like to thank my Ph.D. student, Oliver Tearne, and 
my father, John, both of whom read this book extremely carefully and made a num- 
ber of very helpful comments. For whatever imperfections remain, my apologies 
to them and to my readers. 


Introduction 


Differential equations date back to the mid-seventeenth century, when calculus 
was discovered independently by Newton (c. 1665) and Leibniz (c. 1684). Mod- 
ern mathematical physics essentially started with Newton’s Principia (published in 
1687) in which he not only developed the calculus but also presented his three fun- 
damental laws of motion that have made the mathematical modelling of physical 
phenomena possible. ! 

Historically, advances in the theory of differential equations have come from the 
insights gained when trying to treat specific physical models. Despite this some- 
what piecemeal development, the subject has become a well-defined and coherent 
area of mathematics. This book adopts a theoretical point of view, developing the 
theory to the point at which it can no longer be described as ‘basic differential 
equations’ and is about to become entangled with more advanced topics from the 
theory of dynamical systems. Of course, applications are used throughout to serve 
as motivation and illustration, but the emphasis is on a clean presentation of the 
mathematics. 

You may find that some of the problems covered in the first few chapters are 
already familiar. The methods of solving these problems are well established, and 
you may be well practised at applying them. However, we will take care here to 
show why these methods work; giving proper justification of the methods can take 
some time, but as mathematicians we should not be satisfied merely with a set of 
‘recipes’. Nevertheless, knowing something about the details should not stop you 
from applying the methods you know already; rather you should be able to use 
them with more confidence. 

Some of the chapters, and some sections within other chapters, are marked 
with an asterisk (*). These parts of the book contain either material that is more 
advanced, or material that expands on points raised elsewhere; while they could be 
omitted in the interests of brevity, they are intended to give some indication of the 
richness of the subject beyond the confines of an introductory course. 


! Various modern editions of this work are available, translated from its original Latin. 


2 Introduction 


There are three appendices, covering background material that is necessary at 
various points in the book. While some of this is elementary and may already 
be familiar (Appendix A recalls some notation and various facts about real and 
complex numbers that will be used throughout the book) some is a little more ad- 
vanced. Problems with timetabling often mean that certain undergraduate courses 
have to rely on material that is yet to be taught in others, hence there are appen- 
dices on matrices, eigenvalues and eigenvectors (Appendix B) and on derivatives, 
partial derivatives and Taylor series (Appendix C). The calculation of eigenvalues 
and eigenvectors is treated in detail in the main part of the book. 

The use of mathematical computer packages is now a standard part of the under- 
graduate curriculum, and an important tool in the armoury of practising mathemati- 
cians, scientists and engineers. Although the emphasis in the text is on pencil and 
paper analysis, and the book in no way relies on the availability of such software, 
some topics, particularly the treatment of coupled nonlinear equations using phase 
plane ideas in Chapters 28-37, can benefit greatly from the graphical possibilities 
modern computers provide. Almost all of the figures in this book have been gener- 
ated using MATLAB, and very occasionally particular MATLAB commands are men- 
tioned in the text. Nevertheless, it should be possible to carry out the numerical ex- 
ercises suggested here using any of the major commercially available mathematical 
packages; and with a little more ingenuity using any programming language with 
graphical capabilities. The MATLAB files used to produce some of the figures, and 
mentioned in certain of the exercises, are available for download from the web at 
www.cambridge.org/0521533910. 

There is no better way to learn this material than by working through a selection 
of examples. One set of examples is included in what is, I hope, a natural way in 
the text, with the end of each worked solution marked with a box (LJ). Another set 
of examples is given in the exercises that end each chapter, and these should be 
considered an integral part of the book. The majority consist of sample problems 
that can be treated with the methods of the chapter — in order to give teachers a 
reasonable choice of problems, there are intentionally more of these than you could 
reasonably be expected to do. Others, labelled with a “T’, are more theoretical 
and designed to give an indication of some of the mathematical issues raised, but 
not treated in detail, in the text. Finally, those exercises labelled with a ‘C’ are 
intended to encourage the use of the computer to perform routine calculations and 
investigate equations and their solutions graphically. Those involved in teaching 
courses based on this book may obtain copies of solutions to these exercises by 
applying to the publisher by email (solutions@cambridge.org). 

I would welcome any comments or suggestions, either by post to the Mathe- 
matics Institute, University of Warwick, Coventry, CV4 7AL, U.K. or by email 
to jcr@maths.warwick.ac.uk; any errata that arise will be posted on my own 


website www.maths.warwick.ac.uk/~jcr/IntroODEs.html. 


Part I 


First order differential equations 


1 


Radioactive decay and carbon dating 


Before we start our formal treatment of the subject we will look at a very simple 
example that nonetheless exhibits the power of differential equations as models of 
reality. One point to bear in mind in this chapter is the distinction to be made be- 
tween finding the solution of a differential equation, and interpreting this solution. 


1.1 Radioactive decay 


Let N(t) denote the number of radioactive atoms in some sample of material at 
time t. Then with k > 0 the equation 


aa —kN (1.1) 
is a very good model for the way that the number of radioactive atoms decays (see 
Exercise 1.1). 

Although we will see later how to solve this equation, for now we will assume 
that when there are N, isotopes at time s, the solution is 


N(t) = Nye *"-), (1.2) 


You can check that we really do have the solution: when t = s the formula in (1.2) 
gives N(s) = Ns, while we have 


d 
{Ne = —kNse *"—) — —kN (1), 


and so the differential equation (1.1) is satisfied. 

It follows from (1.2) that the number of radioactive isotopes decays exponen- 
tially to zero. Graphs of the solution for various values of N (0), showing this 
decay, are plotted in Figure 1.1. 

The half-life of a particular radioactive isotope is the time it takes for half of the 
radioactive isotopes to decay, and this is related to the constant k that appears in 
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Half-life 


N (millions) 


15 2 2.5 
t (10000 years) 
Fig. 1.1. Graph showing the number N(t) of radioactive atoms falling off as a 
function of time, for a number of different values of No; the constant k is that for 


radioactive carbon 14. The half-life, approximately 5700 years, is marked by a 
dashed vertical line. 


the equation. To find this relationship, suppose that there are No radioactive atoms 
at time t = 0. Then the solution of (1.1) is 


N(t) = Noe™. 
Half of the atoms will have decayed by time fpaig when N (thait) = 5No, Le. 
Noe = No => Atha = 
Taking the (natural) logarithm of both sides gives 
—kthat = —In 2, 


and so the half-life is given by thaig = (n2)/k. Note that this time does not depend 
on the initial number of radioactive atoms. 


1.2 Radiocarbon dating 


The solution (1.2) forms the basis of the technology of radiocarbon dating. The 
essence of the method is as follows. Living matter is constantly taking up carbon 
from the air. The result is that within such material the ratio of the number of iso- 
topes of radioactive carbon 14 (!4C) to the number of isotopes of stable carbon 12 
(!2C) is essentially constant. Once the specimen is dead (for example, a tree is cut 
down for its wood, or cotton is harvested for weaving), the radioactive 14C atoms 
begin to decay according to the model (1.1). Since the half-life of carbon 14 is 


1.2 Radiocarbon dating 7 


approximately 5700 years, we need to take the constant k in (1.1) to be 


In2 
k= 
5700 


By examining the ratio of the number of isotopes of carbon 12 to carbon 14 
in a sample of the material that we want to date, it is possible to work out the 
proportion remaining of the '4C atoms that were initially present. Suppose that the 
sample stopped taking up carbon from the air when t = s, and that the number 
of !*C atoms present then was N,. If we know that the sample now (at time fg) 
contains only a fraction p of the initial level of 4C, then N(to) = DNs. 

Using our explicit solution N(t) = N,e7*"-), we should have 


~ 1.216 x 107+. 


DN, = N(to) = Nye), 


Cancelling the factor of Ns in the two outside terms yields the equation 


p= e Kos) . 


Taking logarithms of both sides we have 
In p = —k(to — 5), 
and so the year s from which the sample dates is given by 


1 
sip. (1.3) 


In 1988, the Shroud of Turin (see Figure 1.2) was dated by three independent 
groups of scientists from Arizona, Oxford and Zurich. Fibres from the shroud were 


Fig. 1.2. The Shroud of Turin: carbon dated to the fourteenth century. Photograph 
© 1978 Barrie M. Schwortz (his website at www. shroud.com is well worth a 
visit). 
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found to contain about 92% of the level in living matter.! Using the expression in 
(1.3) shows that the Shroud therefore dates from 


In 0.92 
s = 1988 + ——___ ®& 1302, 
0.000 121 6 


putting its origin squarely in the Middle Ages. 


1.1 


1.2 


1.3 


1.4 


1.5 


Exercises 


Radioactive isotopes decay at random, with a fixed probability of decay per unit time. 
Over a time interval At, suppose that the probability of any one isotope decaying is 
kAt. If there are N isotopes, how many will decay on average over a time interval At? 
Deduce that 


N(t + At) — N(t) © —NkAt, 


and hence that dN/dt = —kN is an appropriate model for radioactive decay. 
Plutonium 239, virtually non-existent in nature, is one of the radioactive materials used 
in the production of nuclear weapons, and is a by-product of the generation of power 
in a nuclear reactor. Its half-life is approximately 24 000 years. What is the value of k 
that should be used in (1.1) for this isotope? 

In 1947 a large collection of papyrus scrolls, including the oldest known manuscript 
version of portions of the Old Testament, was found in a cave near the Dead Sea; they 
have come to be known as the ‘Dead Sea Scrolls’. The scroll containing the book of 
Isaiah was dated in 1994 using the radiocarbon technique;” it was found to contain 
between 75% and 77% of the initial level of carbon 14. Between which dates was the 
scroll written? 

A large round table hangs on the wall of the castle in Winchester. Many would like 
to believe that this is the Round Table of King Arthur, who (so legend would have it) 
was at the height of his powers in about AD 500. If the table dates from this time, 
what proportion of the original carbon 14 would remain? In 1976 the table was dated 
using the radiocarbon technique, and 91.6% of the original quantity of carbon 14 was 
found.* From when does the table date? 

Radiocarbon dating is an extremely delicate process. Suppose that the percentage of 
carbon 14 remaining is known to lie in the range 0.99p to 1.01 p. What is the range of 
possible dates for the sample? 


! PE. Damon et al., ‘Radiocarbon dating of the Shroud of Turin’, Nature 337 (1989), 611-615. 


2A. 


J. Jull et al., ‘Radiocarbon dating of the scrolls and linen fragments from the Judean Desert’, Radiocarbon 


37 (1995), 11-19. 
3M. Biddle, King Arthur’s Round Table (Boydell Press, 2001). 
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Integration variables 


Because of the intimate relationship between differentiation and integration 
(discussed in more detail in the next chapter) there will be many integrals in this 
book, and it is worth pausing now in order to make sure that we have an appropri- 
ately unambiguous notation. 

Although in theory mathematicians make careful distinctions between ‘the func- 
tion f’ and ‘ f(x)’, the value that f takes at a particular point x, this distinction is 
rarely maintained in day-to-day informal discussions. 

Usually this does not cause any trouble. However, consider the following prob- 
lem, posed in ‘everyday’ language: 


Find the area under the graph of f(x) between a and x. 


Although the meaning of this is clear, ‘find the shaded area in Figure 2.1’, there is 
some potential for confusion when we try to write this down mathematically, since 
there are too many xs around. Converting the English into symbols gives 


ie f(x) dx, (2.1) 


and it should be clear that this is not satisfactory, since the symbol x is used in two 
different ways: once as the upper limit of the range of integration ( [her and once 
as the variable that is being integrated over (dx). 

When we integrate a function between two limits, for example! 


b 
i f(x) dx, 


the variable that we are integrating over is a ‘dummy’ variable. It is just there to 
tell us how to do the integration, and plays no rdle in the final answer, which will 


! Observe that there is no need to change our notation for this particular definite integral, since no confusion can 
arise as to the réle of x. 
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a 


a x 


Fig. 2.1. ‘Find the shaded area’. 


only depend on a and b. So 


b b b 
i: fexydx = f fe)e0 = | f (8) a. 


(We can change the name of the dummy variable with no effect on the integral.) 

The obvious solution, then, is to change the integration variable in (2.1) to some- 
thing other than x. However, changing the variable to something completely dif- 
ferent from x is likely to be confusing. The approach we will adopt will be to add 
a tilde ~ to the integration variable, so that instead of (2.1) we will write 


i : f (&) dk. (2.2) 


All being well this should keep things ‘clean’ but should not be too jarring. 
We will also do something similar when evaluating integrals where x is an upper 


[ t@a=[ra] 
when F’ = f. 


Of course, very few people are this careful when they are doing calculations and 
the backs of mathematicians’ envelopes are full of things like (2.1) rather than the 
pedantic (2.2). 


limit, i.e. 
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Classification of differential equations 


Before we begin we need to introduce a simple classification of differential equa- 
tions which will let us increase the complexity of the problems we consider in a 
systematic way. 


3.1 Ordinary and partial differential equations 


The most significant distinction is between ordinary and partial differential equa- 
tions, and this depends on whether ordinary or partial derivatives occur. 

Partial derivatives cannot occur when there is only one independent variable. 
The independent variables are usually the arguments of the function that we are 
trying to find, e.g. x in f(x), t in x(t), both x and y in G(x, y). The most common 
independent variables we will use are x and ¢, and we will adopt a special short- 
hand for derivatives with respect to these variables: we will use a dot for d/dt, so 
that 


dz daz 
oat a é 
and a prime symbol for d/dx, so that 


d2 
and y= 7 


ya2 
dx 


Usually we will prefer to use time as the independent variable. 
In an ordinary differential equation (ODE) there is only one independent vari- 
able, for example the variable x in the equation 


dy _ 
ee 
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specifying the slope of the graph of the function y; the variable ¢ in 
mx = f(t) 


which we could solve for the position x(t) = (x(t), y(t), z(t)) of a particle at 
time ¢ moving under the action of a force f(t) (the equation is Newton’s second 
law of motion, F = ma); or x in 


nh? ey 


= Sin dee + Vixw => Ew 


where w(x) = a(x) +i6(x) is complex (this is the Schrddinger equation from 
quantum mechanics). 

In a partial differential equation there is more than one independent variable 
and the derivatives are therefore partial derivatives, for example the heat in a rod 
at position x and time ft, h(x, t), obeys the heat equation 


ah 07h 


Ot Ox2° 


A much more complicated example is given by the Navier-Stokes equations used 
to determine the velocity of a fluid 


WR 895 3) (1 Hs X02, 43, Ty 09 KD, HS BY, UA, 02) 89,2) 


(think of x1 = x, x2 = y, and x3 = z), which are:! 


Ou; : Ou; 07; 07u; 07u; op 
y . = =f;. Gl 
a OX; . Ot: . ax5 ss xz e OX j Li EY 


(one for each component, j = 1, 2,3) and 


du, Ou2  du3 
= 0. 3.2 
Ox - 0x2 4 0x3 Ce) 


In this book we will consider only ordinary differential equations. 


! Ttis possible to write these two equations much more concisely using vector calculus notation. Imagine that V 
represents a vector of partial derivatives, V = (0/0x1, 0/0x2, 0/0x3), which can be manipulated like a normal 
vector. Then, for example, Equation (3.2) is just V-u = 0. Defining also A = V - V (the sum of all second 
derivatives) we can rewrite (3.1) as 


3 
| St + (a: Vu] ~ nau vp=e (3.3) 
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3.2 The order of a differential equation 

The order of a differential equation is the highest order derivative that occurs: the 
equation 

dy 

ae f(x) 
specifying the slope of a graph is first order, as is the following equation expressing 
energy conservation, 

smi? + V(x) =E 


(Zmx? is the kinetic energy while V(x) is the potential energy at a point x); 
Newton’s second law of motion 


is second order; the equation 
yi" 4 xv yw’ —() 


(which occurs in the theory of fluid boundary layers) is third order (recall that yy” 
is shorthand for d?y/dx?). 

To be more formal, an nth order ordinary differential equation for a function 
y(t) is an equation of the form 


d*y d'-! d 
F( et Oe 2, yt) 0. (3.4) 


dt”? dt?-1’ dd 


(Of course we want d” y/dt” to occur in F; if F(, y, y, tf) is y — ¢ then the result- 
ing equation (y — t = 0) is not a differential equation at all.) If ¢ does not occur 
explicitly in the equation, as in 


then the equation is said to be autonomous. 


3.3 Linear and nonlinear 


Another important concept in the classification of differential equations is linearity. 
Generally, linear problems are relatively ‘easy’ (which means that we can find an 
explicit solution) and nonlinear problems are ‘hard’ (which means that we cannot 
solve them explicitly except in very particular cases). 


14 3 Classification of differential equations 
An nth order ODE for y(t) is said to be linear if it can be written in the form 


d"y qr-l 
Onl) + an—\(t) 


Ye taM2 +aMy= FO, 65) 
drn-! dt 


i.e. only multiples of y and its derivatives occur. Such a linear equation is called 
homogeneous if f (t) = 0, and inhomogeneous if f (t) 4 0. 


3.4 Different types of solution 


When we try to solve a differential equation we may obtain various possible types 
of solution, depending on the equation. Ideally, perhaps, we would find a fully ex- 
plicit solution, in which the dependent variable is given explicitly as a combination 
of elementary functions of the independent variable, as in 


y(t) = 3cos 5t + 8 sint. (3.6) 


We can expect to be able to find such a fully explicit solution only for a very limited 
set of examples. 

A little more likely is a solution in which y is still given directly as a function 
of t, but as an expression involving an integral, for example 


t 
rie one [ e-* ds. (3.7) 


Here y is still an explicit function of ¢, but the integral cannot be evaluated in terms 
of elementary functions. 

Sometimes, however, we will only be able to obtain an implicit form of the 
solution; this is when we obtain an equation that involves no derivatives and relates 
the dependent and independent variables.* For example, the equation 


Iny+4Inx —-y—2x+4=0 (3.8) 


relates x and y, but cannot be solved explicitly for y as a function of x. 

All these types of solution will occur in what follows. 

There are many situations, however, in which it is not possible to obtain any use- 
ful expression for the solution. For some equations it is still possible to understand 


2 We could also have an implicit solution containing integrals that cannot be evaluated in terms of elementary 
functions. For example, we will see that the equation dx/dt = f(x)g(t) has solution 


dx 
| fey = [20 


which in general gives such an implicit solution. 
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0 0.5 1 1.5 2 2.5 


Fig. 3.1. A qualitative, graphical solution of the coupled system of equations 
(3.9). The axes are x (horizontally) and y (vertically), and it is safe to assume 
that this is the case for any unlabelled axes in the rest of the book. 


the qualitative behaviour of the solutions, i.e. to describe how the solutions be- 
have, even though we cannot specify them exactly. This is the approach we will 
take in Chapter 7, and throughout Chapters 32-37. Such a description is often best 
expressed graphically. For example, Figure 3.1 shows the phase diagram (or phase 
portrait) for the solutions of the equations 
x=x(4—-—2x —y) 
y = y9 — 3x —3y). 
The diagram is a plot of sample curves traced out by solutions (x(t), y(t)) labelled 
with arrows indicating the direction in which ¢ increases. The crosses show points 
at which the solutions of this equation are constant. We can tell from this dia- 
gram that every solution eventually approaches the point (1, 2) [i.e. x(t) — 1 and 
y(t) > 2 as t — +00], even though we do not have any form of explicit solution 
for (3.9). 
For some equations all our analytical tools may fail, and in this case we can 


(3.9) 


often use a computer to approximate the solution. A ‘numerical solution’ of a 
differential equation is usually only an approximation, and the initial result of such 
a calculation will not be an expression for x in terms of f, say, but a list of times, 
t, and corresponding approximate values for x(t). Using MATLAB’s ODE solving 
routine, ode45, to solve the equation 
dx 9 


—=t-x 


4; x(0) =0 
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between times t = 0 and t = 5, yields such a list: 


>> xdot=inline(’t-x*2’',’t’,’x’); 

>> [t x]=ode45(xdot,[0 5],0); 

>> Pex] 

ans = 0 0 
0.1250 0.0078 2.6250 1.4921 
0.2500 0.0312 2.7500 1.5407 
0.3750 0.0700 2.8750 1.5864 
0.5000 0.1235 3.0000 1.6299 
0.6250 0.1907 3.1250 1.6721 
0.7500 0.2700 3.2500 1/127 
0.8750 0.3591 3:.3:750 Lf 545 
1.0000 0.4555 3.5000 1.7891 
1.1250 0.5563 3.6250 1.8261 
1.2500 0.6585 3.7500 1.8621 
1 3750 0.7596 3.8750 1.8969 
1.5000 0.8574 4.0000 1.9310 
1.6250 0.9505 4.1250 1.9646 
1.7500 1.0377 4.2500 1.9976 
1.8750 1187 4.3750 2.0297 
2.0000 71935 4.5000 2.0612 
2.1250 1.2628 4.6250 2.0925 
2.2500 1.3268 4.7500 2.1231 
2.3750 1.3856 4.8750 23153:1, 
2.5000 1.4403 5.0000 2.1826 


We will discuss one simple method of numerical approximation in Chapter 21. 


Exercises 


3.1 Classify the following equations as ordinary or partial, give their order, and state 
whether they are linear or nonlinear. In each case identify the dependent and inde- 
pendent variables. 

(i) Bessel’s equation (v is a parameter) 


xy" + xy! + (x? — v*)y =0, 
(ii) Burger’s equation (v is a parameter) 


ou a7u ou 
v = 
ot ax2 Ox 
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(iii) van der Pol’s equation (m, k, a and b are parameters) 
mi + kx = ax — bx?, 

(iv) dy/dt =t— y’, 

(v) the wave equation (c is a parameter) 


Py Ady 
a Ax?’ 
(vi) Newton’s law of cooling (k is a parameter and A(t) is a specified function) 
i —k(T — A(t)), 
dt 


(vii) the logistic population model (k is a parameter) 


dp 
T_T = kp 1 = Pp J 


(viii) Newton’s second law for a particle of mass m moving in a potential V(x), 
mx = —V'(x), 
(ix) the coupled equations in (3.9) 
x =x(4-—2x—-y) 
y = yO — 3x — 3y), 
and 
(x) 


d 
eis = Ax, 
dt 


where x is an n-component vector and A is ann x n matrix. 


4 


*Graphical representation of solutions using MATLAB 


The list of numbers that formed the example of a numerical solution at the end 
of the previous chapter indicates how useful a graphical representation of solu- 
tions can be. In fact MATLAB’s default presentation of a numerical solution of a 
differential equation is as a graph: the commands 


>> xdot=inline(’t-x*2’, ‘t’, 'x’); 
>> ode45(xdot, [0 5], 0) 


produce the graph shown in Figure 4.1 (only the axis labels have been added). 

Whichever kind of solution we manage to obtain for our equation, the graph- 
ical capabilities provided by modern computer packages enable us to visualise 
these solutions and so obtain a much better understanding of their behaviour. All 
the solutions in Section 3.4 benefit from a graphical presentation. In this section 
we briefly discuss the main MATLAB commands that can be used to visualise and 
solve a variety of equations. 

Almost all of the figures in Parts I, II, and III of this book are the graphs of 
explicit solutions; these are very easy to produce with MATLAB. For example, to 
plot y(t) = 3cos5t + 8sin¢ against ¢t for 0 < t < 20, the three lines 


t=linspace(0,20); 
y=3*cos(5*t)+8*sin(t); 


plot(t,y) 


produce Figure 4.2. 
If the solution is given as an integral that cannot be evaluated explicitly, like 
(3.7), 


t 
y(t) =1+ / e ds, 
0 
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25 


Fig. 4.1. The solution of x =f—x* with x(0)=0, as produced by the 
MATLAB ode45 command. The individual pairs (x, f) are represented by the 


circles, and are joined to produce an approximation to the solution x(t) of 
the original equations. 


N\) Woof 
| a | N | 


Fig. 4.2. The graph of y(t) = 3 cos 5t + 8 sint (y against f). 


then we can find the value of y at any given value of ¢ by approximating the in- 
tegral; this is something that computers are very good at. The integral of et 
between 0 and 2 (for example) can be evaluated by defining an ‘inline function’ 
f@O= exp(—t*) and then using the quad command: 


>> f=inline(’exp(-t.*2)’,’t’) 


f = Inline function: 


f(t) = exp(-t.%*2) 


20 4 *Graphical representation of solutions using MATLAB 


Fig. 4.3. The graph of y(t) = 1+ f e-* ds. 
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Fig. 4.4. The curve Iny + 4Inx — y—2x+4=0. 
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>> quad(f,0,2) 


ans = 0.8821 
The graph of y against ¢ can be plotted using the short u-file 


f=inline(’exp(-t.*2)’,’t’); 
for i=0:30 
t(i+1)=1/10; 
y(i+1)=1+quad(f£,0,t(it1)); 

end 

plot(t,y) 


Exercises 
Given an implicit formula like (3.8), 
Iny+4Inx — y—2x = —4, 
we can notice that x and y lie on a curve that makes 
F(x, y) =Iny+4lInx — y —2x 
constant. The ‘contour plot’ of the level set F(x, y) = —4, 
>> [x, y]=meshgrid(.01:.1:5, .01:.1:5); 
>> z=log(y)+4*log(x) -y-2*x; 
>> contour(x,y,z,[-4 -4]) 
is shown in Figure 4.4. 


Exercises 


4.1 (C) Plot the graphs of the following functions: 
Gi) y(t) = sin 5¢ sin 50t for 0 < t < 3, 
(ii) x(t) = e ‘(cos 2t + sin 2t) forO < t <5, 
(iii) 


t 
T(t) = / e"— sinsds for O<t<7, 
0 
(iv) x(t) =tInt forO<t <5, 
(v) plot y against x, where 
x(t) = Be ' + Ate‘ and y(t) = Ae“, 


for A and B taking integer values between —3 and 3. 
4.2 (C) Draw contour plots of the following functions: 


(i) 
F(x, y)=x°+y’ for —-2<x,y <2; 
(ii) 
F(x, y) =xy* for -l<x,y<l, 
with contour lines where F = -£0.1, +0.2, +0.4, and +0.8; 
(iii) 
E(x, y) = y’ —2cosx for -4<x,y<4 
(iv) 
E(x,y)=x- 5x? + sy7(x4 — 2x? +2) 
for —2 <x <4 and —2 < y < 2, showing contour lines where 
0.8, 1, 2,3 and 4; 
(v) 


EQ, y=y +x -—x for —-2<x,y <2. 


21 


E=0,0.5, 
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‘Trivial’ differential equations 


In this chapter we consider the simplest possible kind of differential equation, 
one that can be solved directly by integration. Although the problem is relatively 
straightforward, it will serve to introduce several important ideas. 

You have probably already met and solved one simple kind of differential equa- 
tion: 


dy 
a = f(x). G1) 
XxX 


Viewed as an equation to solve for y(x), this asks us to find the function whose 
graph has slope f(x) at the point x. So in order to solve this equation we ‘just’ 
have to find a function whose derivative is f (x). 


5.1 The Fundamental Theorem of Calculus 


Any function F that satisfies F’ = f is called an anti-derivative! of f. Clearly if 
F is an anti-derivative of f then so is F(x) + c for any constant c. 

This terminology allows us to distinguish between reversing the process of dif- 
ferentiation (finding an anti-derivative) and integration (finding the area under a 
curve). Put like this it becomes possible, perhaps, to appreciate how remarkable 
it is that these two concepts are so intimately related. This is formalised in the 
Fundamental Theorem of Calculus (FTC). 

Essentially this theorem says that differentiation reverses the action of integra- 
tion, and that if we know an anti-derivative of f we can calculate the area under 
the graph of f between any two points; it is easy to forget that the FTC is a major 
result because we use it so frequently in order to calculate integrals. 

In the statement of the theorem we use R to denote the set of all real numbers, 
and [a, b] denotes the closed interval a < x < b. 


! The more puzzling word ‘primitive’ is sometimes used instead of ‘anti-derivative’. 
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G(x) 


a x 


Fig. 5.1. G(x) is the area under the graph of f between a and x. 


Theorem 5.1 Suppose that f : [a,b] > R is continuous, and for a<x <b 
define 


G(x) = is f(x) dx (5.2) 


(the integral G(x) is the area under the graph of f between a and x, see 
Figure 5.1). Then 


<0 = f(), 


and furthermore 


b 
i f(x) dx = F(b) — F(a) (5.3) 


for any anti-derivative F of f (i.e. for any F with F' = f). 


We often write (5.3) in the more convenient shorthand 


b b 
i: f(x)dx = | Fe | i (5.4) 


x=a 


Proof (Sketch) If we calculate dG/dx using the formal definition of the derivative 
as a limit (see Appendix C), 


at hs G(x + 6x) — EQ) (5.5) 


8x0 ox 
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ye cease f(x) 


G(x+5x)—G(x) 


0 


X X+5xX 


Fig. 5.2. f is essentially constant on the narrow strip [x, x + 8x]. 


then we get 


dG oe 1 x+8x ave x ae 
S00) = sim, = (f" remar- [” pear) 


1 x+6x 
= lim ~| f (®) dk. 


8x0 OX 


The expression 


x+8x 
i} f (&) dk 


(5.6) 


represents the area in the little strip between x and x + 8x (see Figure 5.2). Since 
f(x) © f(x) for this range of x the value of (5.6) is roughly 5x f(x), and so we 


get 


1 
G'(x) © lim, 5, 8 FO) = £00: 


in other words G(x) is an anti-derivative of f(x). This argument can be made 


precise if f is continuous (see Exercise 5.9). 


We now show how we can use an anti-derivative in order to calculate a definite 
integral (between two fixed limits) as in (5.3). If F is any anti-derivative of f then 
(d/dx)(F — G) = F’ — G’ = Oand so F and G can only differ by a constant, 


F(x) = G(x) +c. 
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Since G(a) = 0 (from its definition in (5.2)) we have F(a) = c, and so 


b 
i F(x)dx =G0) 
= F(b)—-c 
— F(b) - F(a), 


which is (5.3). 


Because of the relationship between anti-derivatives and integrals, the notation 


. f(x) dx, (5.7) 


(note the lack of limits on the integral) is often used as a shorthand to mean ‘an 
anti-derivative of f’. We will use this notation at times, but when we need to be 
more careful we will explicitly use a particular choice of anti-derivative F(x). 


5.2 General solutions and initial conditions 


Now let us return to our simple differential equation” 


d 


ies 
qe (5.8) 


Any anti-derivative F of f is a solution of this equation (y(x) = F(x)); hence we 
could simply write 


ree [ fooen. 


If we choose one particular anti-derivative F’, then we know that not only is 
y(x) = F(x) a solution, but also y(x) = F(x) + c for any c. So as it stands (5.8) 
has many solutions. We say that 


yo) = Qc (5.9) 


is the general solution of the equation (5.8), since any possible solution of (5.8) 
can be obtained by choosing c appropriately. It should not be a surprise that there 
are many possible solutions; we can move a graph ‘up and down’ and not change 
its slope — all the curves in Figure 5.3, which differ by only a constant, have the 
same slope at any given x value. 

As illustrated in the figure, one way to pick out a particular solution is to give a 
point that must lie on the graph of y, in other words to specify the value y(xo) of 


2 Usually we will not make explicit the dependence of dy/dx on x (and similarly for other derivatives). 
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(XyYo) 


Fig. 5.3. Adjusting the constant c in (5.9) corresponds to moving the graph of F 
‘up and down’ and does not affect the slope. An initial condition (xo, yo) will pick 
out one of the curves. 


y at some particular x value, x9. We refer to such a restriction 


y(xo) = yo 


as an initial condition, the idea being that we could construct the solution of (5.8) 
starting at (xo, yo) and then drawing the graph by using the information about the 
derivative of y contained in (5.8). 

There are two ways to find the solution of (5.8) that satisfies y(x9) = yo. You 
should make sure that you understand what follows, since we will use similar 
reasoning very often throughout the rest of the book. 

For the first method we do a little more than we have to: we find the general 
solution, and then solve a very simple algebraic equation to find the correct con- 
stant. We have seen that if we can find one anti-derivative F of f then the general 
solution of (5.8) is 


y(x) = F(x) +c. 
The particular solution that we want has y(xo) = yo, and so we need 
yo = y(xo) = Fo) + => c = yo — F(xo). 
Thus the solution with y(x9) = yo is 
y(x) = yo + F(x) — F(xo). (5.10) 
The alternative is to proceed more directly, and integrate both sides of 


dy 
a f (x) (5.11) 
X 
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between xo and x. Then we get 


“dy 2 oo Fist ea 
/ =a ai = | f(x) dx, 
x0 dx xo 
pm] =f reman 

X=XQ x0 


Putting in the limits on the left-hand side this is 


which gives 


Cenc / FS) ak. (5.12) 
X0 


You should make sure that you are happy going straight from (5.11) to (5.12); we 
will generally skip the two intermediate steps. 
Since y(xo) = yo, (5.12) gives the solution in the form 


y(x) = yo +/ f(x) dx. (5.13) 
x0 


(The FTC shows that we do indeed have y’(x) = f(x), and clearly y(xo) = yo as 
required.) If we know that F is an anti-derivative of f then we can use the FTC, 


b 
/ f(x) dx = F(b) — F(a), 


(this was (5.3), and is just the usual rule for evaluating integrals) to write the solu- 
tion more explicitly as 


y(x) = yo t+ F(x) — F(xo). 


Of course, this is the same expression that we obtained above in (5.10). Note, 
however, that in some cases the integral form in (5.13) may be the best that we can 
do, if it is not possible to find an explicit anti-derivative of f. 

We now look at some simple examples. 


Example 5.2 Find the general solution of the equation 


dy ; 
—=x+ 10sinx. (5.14) 
dx 


What is the equation of the graph with slope x + 10 sin x passing through the point 
(tr, 0)? 


In order to find the general solution we nee to find an anti-derivative of 
x + 10sinx. Using standard integrals this is 5x? — 10cos.x, and so the general 
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él (11,0) 


20 - 


25 
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0 1.5708 3.1416 4.7124 6.2832 


Fig. 5.4. The graph of y(x) = 5 (x? — 7m) — 10(1 + cos x). It passes through the 
initial condition (a, 0), which is marked by a cross. 


solution of (5.14) is 
y(x) = 5x? — 10cosx +c 
for any c. The one solution that passes through (77, 0) must have 
O=ym=pm—--lWt+ce = e=—zm—10, 
and so 
y(x) = 5 (x? — m*) — 10(1 + cosx). 


The graph of y against x is shown in Figure 5.4, along with the initial condition. 


Example 5.3 A curve passing through the point (1, 0) has slope Inx. What is the 
equation of the curve? 


We have to solve the equation 


dy 1 
—=Inx. 
dx - 


Since we have already used the ‘long-winded’ method in the first example, let us 
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4.57 
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Fig. 5.5. The graph of y(x) = 1+.xInx — x, with the initial condition (1, 0) 
marked by a cross. 


do this one directly. We integrate both sides of the equation between 1 and x to 
give 


y(x) — yd) = / Inx dx 
1 
= [sins -3| 
xK=1 
= (x Inx — x) —-(-1) 
and so (since we want y(1) = 0) 


y(x) =14+xInx —x. 


This solution is shown in Figure 5.5. 
In the next section we will give some more examples, this time more practically 
based, using Newton’s second law of motion (F = ma). 


5.3 Velocity, acceleration and Newton’s second law of motion 


Newton formulated the calculus, and his theory of differential equations, in order 
to be able to write down and solve the mathematical models that resulted from 
his laws of motion. Since derivatives are essentially the ‘rate of change’, questions 
concerning velocities (the rate of change of position) and acceleration (the rate of 
change of velocity) are most naturally framed as differential equations. 
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Newton’s second law of motion states that the change Ap in the momentum p 
of an object is equal to F,, the force applied, multiplied by the time At over which 
the force acts, 


Ap = F At. 


Dividing by At and letting At tend to zero (this is, of course, a somewhat imprecise 
derivation) we obtain 
dp 
— = F(t). 
a (t) 
Since the momentum is the mass m times the velocity v, i.e. p = mu, if the mass 
is constant we obtain 
dp d du 
— = —(mv) = m— = Fit). 
dt dt my) dt ©) 
The rate of change of v, du/dt, is precisely what we mean by the acceleration, and 
so this equation is the familiar formula “F = ma’ written another way. 


Example 5.4 A car of mass m is travelling at a speed vg when it suddenly has to 
brake; the brakes apply a constant force k until the car comes to rest. How long 
does it take the car to stop, and how far does it travel before it comes to rest? 


Using Newton’s second law we have 


du 


m—_ = ’ 
dt 
since the force acts to oppose the motion of the car. Rewriting this as v = —k/m 
and integrating both sides between times 0 and ¢ we get 


Ee es 
veo-w=- f — dt, 
om 


or 
kt 
1) VS, 
m 
and so 
kt 


v(t) = v0- —; 
m 


see Figure 5.6. The car stops when v(f,) = 0; this implies that t, = mvo/k. 
Since the velocity v is the time derivative of the position x, v = x and we have 
kt 


dx 
— =v(t) =u - —. 
a v(t) = v9 - 
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Fig. 5.6. The speed of a car that suddenly brakes: v(t) = vo — (kt/m), with 


m = 1000 kg and k = 6500 N and for initial speeds of 30 mph (solid line) and 
35 mph (dashed line). 
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Fig. 5.7. The distanced moved by the car once the brakes have been applied. The 
choice of k and m is as for Figure 5.6, and again the solid line is for an initial 
speed of 30 mph and the dashed line is for an initial speed of 35 mph. 


Integrating both sides between ¢ = 0 and f = f, (when the car stops) we get 


=a) = [( 
0 


Substituting for tf, we have 


as shown in Figure 5.7. 
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Since the stopping distance is proportional to the square of the speed, relatively 
small increases in speed will have a marked effect on the stopping distance. The 
stopping distance for a car travelling at 35 mph will be 49/36 of that for a car 
travelling at 30 mph, almost half as much again for just 5 mph extra speed,’ see 
Figure 5.11. 


5.4 An equation that we cannot solve explicitly 


We remarked above that there are many cases in which the best that we can do 
is to find the solution in the form of an integral, as in (5.13) where we wrote the 
solution of the general equation 


dy 
ae f(x) with y(xo) = yo 
X 


as 
JO=sre i. rears (5.15) 
x0 


However, it can still be possible to describe qualitatively the behaviour of the 
solution. Here we consider a simple example, 


dx 2 

—=e 0) = xo. 

a x(0) = x0 
Integrating both sides between times 0 and f¢ gives the solution 


t 


x(t) = x0 +/ art dt (5.16) 
0 


This is as far as we can go without resorting to approximation, since there is no 
explicit form for the anti-derivative of e~" . 
However, it is known’ that 


PO 05 
/ SP deal? 
0 


w 


Realistic values of m and k are m = 1000 kg and k = 6500 N (one newton is one kg m/s) which means that 
the stopping distances at 30 mph (* 13.4 m/s) and 35 mph (* 15.6 m/s) are 13.8 m and 18.7 m respectively (to 
fall prey completely to the British imperial/metric confusion, that is roughly 40 feet and 60 feet respectively). 
An extra 10 mph on the motorway means that the stopping distance at 80 mph is 64/49 of the stopping distance 
at 70 mph, over a quarter as much again: 225 feet (© 75 m) at 70 mph (© 31 m/s) and 300 feet (~ 100 m) at 
80 mph (* 36 m/s). 

This ‘Gaussian integral’ arises frequently, making it very frustrating that it cannot be evaluated explicitly. In 
particular the normal distribution, which is fundamental in the theory of statistics, is described by a bell-shaped 
curve whose equation is e* /2 //27; statistical tables for the normal distribution are essentially based on 
evaluating the integral in (5.16) numerically. 
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Fig. 5.8. We cannot solve the equation explicitly, but we know that the solution 
always increases and tends to a value \/7/2 greater than its initial value. 


: : 2% : : 
So we can say, since x =e" is always strictly greater than zero, that x(t) in- 


creases as ¢ increases, and that 


x(t) > x9 + J 7/2 


as t —> ov, see Figure 5.8. Even though we cannot write down an explicit form for 
the solution, we can still say exactly what happens ‘eventually’. In this way we 
can still understand something about the behaviour of the solution. This ‘eventual’ 
behaviour is often referred to as the long-time, or time asymptotic, behaviour. 


Exercises 


5.1 Find the general solution of the following differential equations, and in each case find 
the particular solution that passes through the origin. 


(i) 


dé : 
— = sint+cosf, 
dt 


(ii) 


(use partial fractions) 
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(iii) 
dU 
— = 4f Int, 
dt 
(iv) 
d 
Sf ge : 
dx 
and 
(v) 
dT ee 
— =e sin2t. 
dt 
Find the function f(x) defined for —t/2 < x < m/2 whose graph passes through the 


point (0, 2) and has slope —tan x. 

Find the function g(x) defined for x > —1 that has slope In(1 + x) and passes through 

the origin. 

Find the solutions of the following equations satisfying the given initial conditions: 
(i) 


x=sec’?t with x(m/4) =0, 


(ii) 
ys=x—7x° with = y(-1) = 1, 
(iii) 
do ae 
ane 2 sin’ t with 6(m/4) = 7/4, 
(iv) 
dV 
x—=14+x* with V(d)<=1, 
dx 
and 
(v) 


d 
= [ze*] =e with x0) =3, 


The Navier-Stokes equations that govern fluid flow were given as an example in Chap- 
ter 3 (see equations (3.1) and (3.2)). It is not possible to find explicit solutions of these 
equations in general. However, in certain cases the equations reduce to something 
much simpler. 

Suppose that a fluid is flowing down a pipe that has a circular cross-section of radius 
a. Assuming that the velocity V of the fluid depends only on its distance from the 
centre of the pipe, the equation satisfied by V is 


ld dV 
r =—-P, 
r dr dr 


where P is a positive constant. 
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Fig. 5.9. The quadratic velocity profile in a circular pipe. 


Multiply by 7 and integrate once to show that 


where c is an arbitrary constant. Integrate again to find an expression for the velocity, 
and then use the facts that (i) the velocity should be finite at all points in the pipe and 
(ii) that fluids ‘stick’ to boundaries (which means that V (a) = 0) to show that 


Vir) = 7 @ _ Py, 


see Figure 5.9. (This is known as Poiseuille flow.) 
5.6 An apple of mass m falls from a height h above the ground. Neglecting air resistance 
its velocity satisfies 


dv 


te = —mg v(0) = 0, 


where v = y and y is the height above ground level. Show that the apple hits the 
ground when 


5.7 An artillery shell is fired from a gun, leaving the muzzle with velocity V. If the gun is at 
an angle @ to the horizontal then the initial horizontal velocity is V cos 6, and the initial 
vertical velocity is V sin 6 (see Figure 5.10). The horizontal velocity remains constant, 
but the vertical velocity is affected by gravity, and obeys the equation v = —g. How far 


Fig. 5.10. Firing a shell at muzzle velocity V at an angle 6 to the horizontal. The 
shell follows a parabolic path. 


At 5mph over the 30mph limit, 
how much further does it take to stop? 


Twenty one feet, that’s over six metres. 
At 30mph, you would have stopped in front of the first child. 


At 35mph, you would not only hit the first child, but you would still be 
doing 17mph when you hit the second child. 


It doesn’t matter how good a driver you are. The faster you go, the longer 
it takes to stop. In this case the extra Smph means the total stopping 
distance of an average saloon with ABS is an agonising ninety six feet, or 
twenty nine metres. Which brings you to the feet of the third child. 


30mph might feel slow, but only until someone steps out in front of you. 


THR 


Slow down. 


Fig. 5.11. A recent UK campaign to persuade drivers to cut their speed in town 
from 35 mph to 30 mph. The film at www. thinkroadsafety.gov.uk/ 
slowdown/download/slowdown.mpg makes the point more forcefully. 


5.8 


5.9 
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does the shell travel before it hits the ground? (Give your answer in terms of V 
and 6.) 

In Dallas on 22 November 1963, President Kennedy was assassinated; by Lee 
Harvey Oswald if you do not believe any of the conspiracy theories. Oswald fired 
a Mannlicher—Carcano rifle from approximately 90 m away. The sight on Oswald’s 
rifle was less than ideal; if the bullet travelled in a straight line after leaving the rifle 
(at a velocity of roughly 700 m/s) then the sight aimed about 10 cm too high at a target 
90 m away. How much would the drop in the trajectory due to gravity compensate for 
this? (The initial vertical velocity v is zero, and satisfies the equation v = —g, while 
the horizontal velocity is constant if we neglect air resistance.) 

(T) This exercise fills in the gaps in the proof of the Fundamental Theorem of Calculus. 
Suppose that f is continuous at x, i.e. given any € > 0, there exists a bd = d(€) such 
that 


|Ix—x| <6 > lf@) — f@)| <e. 
By writing 
1 a+ox 
f@=e | fx) di 
X Sx 


show that for all 6x with |dx| < 6(e) 


1 x+dx 
yoo =| fwd se, 
bx Jy 
and hence that 
1 x+6x 
lim — x) dx = : 
aan al f(x) dit = f(x) 


You will need to use the fact that 


b 
i g(x) dx 


b 
< | eG 20 ay ae EO). 
a xeé[a,b] 
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Existence and uniqueness of solutions 


Because we are going to spend some time trying to solve equations like 


= = f(x,t) (6.1) 
dt 
we need to be sure that such equations will actually have solutions. Clearly it is a 
hopeless task to search for a solution of (6.1) if the solution does not exist (hunting 


for a unicorn will take you a very long time). 


6.1 The case for an abstract result 


It is quite easy to write down equations that do not have any solutions, for example 


>, dx 
x-+t oa 0 x(0) =c 
does not have any solutions if c 4 0: if t = 0 then the second term of the differen- 
tial equation disappears and we must have x (0) = 0. 

We have already seen that there are many possible solutions when we want to 
find a function whose graph has a particular slope; the question of the uniqueness 
of solutions of a differential equation is somewhat subtle. However, we saw that by 
specifying a particular initial condition we could tie down one particular solution. 
So the problem that we will consider for our general theory will be the initial value 
problem (IVP), consisting of the differential equation supplemented by an initial 
condition, 


dx 
ap = f(x,t) = x(to) = Xo. (6.2) 
If we suppose for a moment that the independent variable is time, then the re- 
quirement of uniqueness has a physical interpretation. Suppose that we specify an 


initial condition ‘now’, i.e. at time ¢ = 0; then the existence of a unique solution 
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means that we can use the equation to predict the future, since the solution is 
uniquely determined for t > 0. In this context, uniqueness of solutions is equiva- 
lent to the requirement that our model be deterministic. 

Uniqueness is also useful since occasionally we may be able to guess what the 
solution of an equation is. If we substitute this guess in and it works, then it must in 
fact be the solution since we know that there is no other. We have already used this 
implicitly in Chapter 1 when we just checked that our solution N(t) = N,e*¢-%) 
worked, and then assumed that it must be the only solution. 

As with existence, uniqueness is not automatic. The innocuous looking [IVP 


dx/dt= Vx  x(0) =0 (6.3) 


has an infinite number of solutions. The ‘obvious’ solution is x(t) = 0 for allt > 0. 
But if you choose any value of c > 0, the function 


0 t<c 


Xe(t) = fe —c)*/4 t>c 


also satisfies the equation. Here the solution ‘waits around’ at x = 0, before even- 
tually ‘deciding’ (at time t = c) to wander off slowly to infinity. Some of the solu- 
tions of (6.3) are shown in Figure 6.1. 

The issues of existence and uniqueness are real, and it is possible to come up 
with very simple equations in which they fail. The good news is that there is a very 
general theorem guaranteeing existence and uniqueness, with a hypothesis which 
is very simple to check. 


0.57 


Fig. 6.1. A number of solutions of equation (6.3). 
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6.2 The existence and uniqueness theorem 


The proof of the general existence and uniqueness theorem is beyond the scope 
of this book, and we will just state the result. However, if you are interested 
Exercise 6.4 leads you through an outline version of the proof. 

In order to state the theorem properly we need to have a more precise idea of 
what we mean by a ‘solution’ of the initial value problem (6.2). The main point 
of the definition is that we allow for a solution to be defined only for some interval 
of t values, and do not require it to be defined for every ¢ € R. 


Definition 6.1 Given an open interval I that contains to, a solution of the initial 
value problem 


ew = f(x,t) with X(to) = xo (6.4) 


on I is a differentiable function x(t) defined on I, with x(to) = x9 and x(t) = 
f(@,t) forallt € I. 


The way that the definition specifies the interval on which the solution exists 
(rather than insisting that it be defined for every value of t € R) may seem pedantic 
at first, but we will soon see that this is necessary even for some very simple 
equations, since it is possible for the solution to ‘blow up’ in a finite time. 

But for now, given our formal definition of a solution, we can state the existence 
and uniqueness theorem. ! 


Theorem 6.2 Jf f(x,t) and df/0x(x, t) are continuous for a < x <b and for 
c <t <d then for any xo € (a,b) and tg € (c, d) the initial value problem (6.4) 
has a unique solution on some open interval I containing to. 


Essentially the result says that if the function f(x, f) is ‘sufficiently nice’ then 
the equation will have a unique solution, at least close to t = fo (see Figure 6.2). 
However, the result tells us nothing about how large the interval is on which the 
solution can be defined. 

In almost all of the examples we meet, f will be ‘sufficiently nice’; but we have 
already seen one simple example in (6.3) for which there is no uniqueness. This 
does not contradict Theorem 6.2, since the derivative of x!/ is infinite at x = 0: 
when f(x) = x!/?, we have 7 )= 5x2, and this becomes infinite as x | 0, 
so f’ is certainly not continuous at x = 0. 


! Im fact the conditions on f in the theorem are a little stronger than they need to be. It is only necessary that the 
function f is a Lipschitz continuous function of x, which means that 


Ifa.o-fO,0l < Llx— yl (6.5) 


for some constant L. Any function with continuous first derivative is Lipschitz continuous (see Exercise 6.2), 
but not every function that is Lipschitz continuous has continuous first derivative (e.g. f(x) = |x|). 
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Fig. 6.2. Given an initial condition x (to) = xo, the existence and uniqueness the- 
orem only guarantees the existence of a solution defined on some open interval 
(marked by the bold line on the horizontal axis) containing the initial time fg. 


Fig. 6.3. The derivative of x!/*, plotted here against x, is not continuous at zero, 
where it is infinite. 


6.3 Maximal interval of existence 


We now give an example showing that we need the freedom to specify the inter- 


val on which the solution of an equation exists if we want a result as general as 
Theorem 6.2: 


d 
| a> ¥OV2K. (6.6) 


Since x” and its derivative 2x are continuous, the equation certainly has a unique 
solution that exists in some open interval containing t = 0. We will see how to 
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derive the solution of this equation in Chapter 8. For now observe that 


1 
x(t) = >— (6.7) 
xj -t 


satisfies the equation (provided that x9 4 0): clearly when t = 0 we have x(0) = 
xo, and differentiating gives 


2 
Seige) = | )anor 
t 


so that the equation is satisfied. Since we know that the solution of the equation is 
unique, we must have the solution. 

Our solution has some interesting properties. If x9 > O then the denominator 
is initially positive (at t = 0), but decreases as ¢ increases until it reaches zero at 
time t = Xo ' This means that the solution, x(t), has become infinite by the time 
t=Xo 1 we say that it ‘blows up’ in a finite time. 

Things are much nicer, though, if we want to see where our solution came from 
in the past. We can decrease t (from zero) as much as we like, since as ¢ de- 
creases the denominator becomes larger, and so the solution itself tends to zero 
as t — —oo. So when xp > O we can define the solution of (6.6) on the interval 
(—00, Xo 1 but there is no way to define the solution on an interval that extends 
further into the future beyond the time t = x9 ! We refer to (—00, Xq. I) as the 
maximal interval of existence for (6.6). 

Note that Figure 6.4 also shows that solutions with x9 < 0 tend to —oo as t 
decreases towards a finite t* < 0. When xg < 0 the maximal interval of existence 
is (Xp - +o), and only for x9 = 0 can we define a solution for all t € R (and then 
the solution is x(t) = 0). 

The two ill-behaved equations in this chapter (x = x!/* and x = x7) should 
serve as cautionary examples as to the limitations of the existence and uniqueness 
theorem. That said, almost all the examples we meet in what follows will have 
unique solutions that exist at least for all t > 0. 


6.4 The Clay Mathematics Institute’s $1 000 000 question 


The questions of existence and uniqueness are somewhat dry, but there are still ex- 
tremely important mathematical models for which these issues are not resolved. 
Outstandingly, it is still not known whether the Navier-Stokes equations that 
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Fig. 6.4. For positive initial conditions the solutions of x = x* blow up ina finite 
time, but exist for all negative values of t; while for negative initial conditions the 
solutions blow up for a finite value of t < 0 but exist for all t > 0. 


model the flow of fluids, 


p| + (a- Vou] want Vp =F V-u=0 
(cf. (3.3)), have unique solutions that exist for all positive times. 

These equations are the basis of computational design of everything that in- 
volves fluid flow; given that the term ‘fluid’ includes both liquids (in particular 
water) and gases (in particular air), numerical methods based on these equations 
are extremely important commercially. Clearly given the financial investment in- 
volved, people are confident that these equations really can predict the behaviour 
of physical systems, but currently we have no guarantee. Most tellingly, you can- 
not prove that a numerical approximation is ‘close’ to the ‘true solution’ if you do 
not even know that such a solution exists. 

For the year 2000 the Clay Mathematics Institute, based in America, announced 
seven Millennium Prize Problems; for the solution of any of these they will award 
a prize of one million dollars.? One of these problems is to determine whether 
or not the three-dimensional Navier-Stokes equations are indeed a good physical 
model, i.e. whether or not they have unique solutions valid for all positive times. 
There are of course, two ways to win this prize: either to invent some insightful 
new mathematics that will prove the existence of unique solutions; or to dream up 
a single initial condition for which the solution breaks down. 


2 See www. claymath.org/index.htm 
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Exercises 


Which of the following differential equations have unique solutions (at least on some 
small time interval) for any non-negative initial condition (x(0) > 0)? 

(i) x =x(1 — x?) 

(ii) x =x 
(ili) x = 
(iv) x =x'/2(014+ x) 

(v) x= (14 x)3/2, 
(T) The Mean Value Theorem says that if f is differentiable on an interval [a, b] then 
f(a) — f(b) = (6 —a)f'(c) for some c € (a, b). Suppose that f(x) is differentiable 
with | f’(x)| < L for a < x < b. Use the Mean Value Theorem to show that for a < 
x,y <b we have 


1/3 


| 
be 


|f(x) — fQy)| < Llx — yl. 


(T) This exercise gives a simple proof of the uniqueness of solutions of 
x= f(x,t) — x(to) = x0, (E6.1) 
under the assumption that 
lf@,t)— fy, Ol < Llx — yl. (E6.2) 


Suppose that x(t) and y(t) are two solutions of (E6.1). Write down the differential 
equation satisfied by z(t) = x(t) — y(t), and hence show that 


d 
ag: = 221 fa),1) — fOM, oO). 


Now use (E6.2) to show that 
di 2 2 
— < 2L\|z\*. 
ql S 2ele| 


If dZ/dt < cZ it follows that Z(t) < Z(to)e°“—™ (see Exercise 9.7): use this to de- 
duce that the solution of (E6.1) is unique. Hint: any two solutions of (E6.1) agree when 
Lh. 


(T) The proof of existence of solutions is much more involved than the proof of their 
uniqueness. We will consider here the slightly simpler case 
x= f(x) with x(0) = xo, (E6.3) 
assuming that 
If) — f)| s Llx — yl. (E6.4) 


The first step is to convert the differential equation into an integral equation that is 
easier to deal with: we integrate both sides of (E6.3) between times 0 and ¢ to give 


t 
x(t) = xo+ [ f (x(i)) di. (E6.5) 
0 
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This integral equation is equivalent to the original differential equation; any solution 
of (E6.5) will solve (E6.3), and vice versa. 

The idea behind the method is to use the right-hand side of (E6.5) as a means of 
refining any ‘guess’ of the solution x,(¢) by replacing it with 


t 
Xngi(t) = x04 [ f (p(i)) dF, (E6.6) 


We start with xo(t) = xo for all f, set 


t 
PORTE TE [ F xo) df, 


and continue in this way using (E6.6). The hope is that x,,(t) will converge to the 
solution of the differential equation as n — oo. 
(i) Use (E6.4) to show that 


t 
IXn4i(t) — Xn) < uf xn (t) — Xn—1(0)| df, 
0 
and deduce that 


1 
max |Xn4101) — Xn()| S 5 max [xp (t) — Xn-1(0)I- (E6.7) 
té[0,1/2L] 2 re[0,1/2L] 


(ii) Using (E6.7) show that 


1 
{) = t t)- t 
bint — XO S symp mar, be) — x00. 


re[0. 1/22] 

(ii) By writing 
Xn(t) = [Xn (t) — Xn-1)] + [en-1) — Xn-2(9)] 

+--+ + [x1(t) — xo(t)] + xo(t) 


deduce that 


1 
lan) — XmO] S syry ,_ man |r) — x0) (E6.8) 


eo. TPL] te[0,1/2L] 


foralln,m > N. 
It follows that x, (t) converges to some function Xoo (t) as m —> oo, and therefore taking 
limits in both sides of (E6.6) implies that 


t 
Ka S53 / Foo) a. 
0 


Thus xo0(f) satisfies (E6.5), and so is a solution of the differential equation. The pre- 
vious exercise shows that this solution is unique. 


ei 


Scalar autonomous ODEs 


For the most part when considering first order equations we will concentrate on 
finding explicit solutions. However, in this chapter we will see how, for the partic- 
ular class of equations of the form 


d 


Xx 
al 


we can understand the solutions ‘qualitatively’, even if we cannot (or do not) write 
down their solutions explicitly. 

What this means is that instead of writing ‘x (t) = something’ we describe how 
the solutions behave, e.g. ‘any solution starting with x(0) between zero and one 
tends to x = 1 as f — oo’ or ‘the point x = —1 is stable’. There is a very simple 
way to represent all this information about solutions pictorially, and the method 
essentially reduces to sketching the graph of the function f (in fact we only need 
to know where / is positive and negative). Nevertheless, the qualitative results we 
will obtain are completely rigorous. 


7.1 The qualitative approach 


The key observation is that the existence and uniqueness result of Theorem 6.2 
tells us that, provided f is ‘nice’, a solution of 


dx 
rae) (7.1) 
is completely determined by its value at any time ¢. The equation itself can then be 
used to determine whether x(t) is increasing or decreasing, depending on the sign 
of f. 
The easiest way to think of this kind of equation is to imagine that x(t) repre- 
sents the position of a particle moving on a line at time ¢. We can then talk about the 
‘velocity of the particle’ rather than the more cumbersome ‘rate of change of x’. 
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Fig. 7.1. A sketch of the function f against x, and the phase diagram for the 
equation x = f(x). On the phase diagram the stationary points are represented as 
crosses and the arrows indicate whether the solution is increasing or decreasing. 


Whatever the equation really represents, we can use the particle idea while solv- 
ing it, and then reinterpret our results for the original application when we have 
finished. 

In order to understand how solutions behave we first find all the values of x at 
which the particle does not move; this happens at the points x* where f(x*) = 0. 
As is often the case with fundamental ideas, such points have many names; we will 
call them stationary points. 

In regions where f(x) > 0 the solution x(t) is increasing, and so the particle 
is moving to the right; similarly wherever f(x) < 0 the solution x(t) is decreas- 
ing, and the particle is moving to the left. Note that the particle cannot reverse 
the direction in which it is moving; if it were to do this then at some time t* 
it would have to be instantaneously at rest, so that x(¢*) = f(x(t*)) = 0. But 
then x(t*) would be a stationary point, and so the particle would not be able to 
move. 

The simplest way to present all this information is to draw a line representing the 
x coordinate. Stationary points are usually indicated by a cross (x); we then draw 
arrows on the line indicating the direction in which x(t) is changing: if f(x) > 0 
then the particle will move to the right and if f(x) < 0 then the particle will move 
to the left. If we sketch the graph of f then it is easy to see the regions in which f 
is positive and negative. An example is shown in Figure 7.1. 

The picture of the line, with the stationary points and the direction of travel of 
the solution indicated, is known as the ‘phase diagram’ or ‘phase portrait’, which 
is shown on its own in Figure 7.2. (Figure 7.1 has two components; the phase 
diagram, and the sketch of f that makes it easier to draw.) With this kind of picture 


! Other common terms are equilibrium points, fixed points and critical points. ‘Equilibrium point’ has a more 
physical flavour than the general tone of this book, and we will reserve the term ‘fixed point’ for use with 
iterated maps in Chapter 23 (a fixed point will be a point for which x* = f(x*)). We will use the term ‘critical 
point’ for a point at which a function F(x) has all its partial derivatives zero, see Appendix C. 
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> <q < > <{ 
Fig. 7.2. The phase diagram from Figure 7.1. 


qualitative behaviour of the solutions at a glance, even when we cannot write down 
the solutions explicitly. 


7.2 Stability, instability and bifurcation 


Looking at the phase diagram for the above example, we can see that some sta- 
tionary points are ‘attracting’ (nearby solutions approach), while some appear to 
be ‘repelling’ (nearby solutions move away). These ideas can be made mathemat- 
ically precise and are extremely important in applications. 

A stationary point is stable if when you start close enough to it you stay close 
to it. More precisely, a stationary point x* is stable if given any € > 0 there exists 
a é > O such that 


Ixo —x*| <6 > Ix(t)—x*|<e forall ¢>0. (7.2) 
EE a _eSSSs)85 898 897 
start close enough stay close 


The stationary points in the example above have the stronger property of being 
attracting, which means that if you start close enough you actually tend to the 
stationary point: there exists a 6 > O such that 


Ixo —x*| <6 => x(t) > x* as t>+00. (7.3) 
| ——— 
start close enough tend to 


In one-dimensional systems attracting points are stable (see Exercise 7.9) but 
this is not true in general. Conversely, an example with many stable stationary 
points that are not attracting is 


—Xx x <0 
dx 
—= 0 O0<x<l 
dt 

1-x x >. 


Here, all the points in the interval [0,1] are stationary points, and they are all 
stable. However, none of them are attracting, since there are nearby points that 
move no closer. The phase diagram is shown in Figure 7.3. 

A stationary point is unstable if it is not stable; this means that no matter how 
close (5) you start you will always move some stationary distance (€) away: there 
exists an € > O such that whatever 5 > O you take there is a point with |xg — x*| 
< 6 but |x(t) — x*| > € for some t > 0. 

The stationary points from Figure 7.2 are labelled according to their stability or 
instability in Figure 7.4. 


7.3 Analytic conditions for stability and instability 49 


-0.8- | 


+1 Ll L L L Ll 
-1 -0.5 0 0.5 1 15. 2 


Fig. 7.3. The thick line consists entirely of stationary points, all of which are 
stable but none of which are attracting. 


S U U S 


Fig. 7.4. The phase diagram from Figure 7.2, but with the stationary points la- 
belled according to their stability type, S for stable and U for unstable. 


Fig. 7.5. The stability or instability of a stationary point x* can be determined 
from the value of f’(x*) provided that f’(x*) 4 0. If f’(x*) < 0 then the sta- 
tionary point is stable, and if f’(x*) > 0 the stationary point is unstable. 


7.3 Analytic conditions for stability and instability 


There are very simple conditions on the derivative of f which will let us know 
whether a stationary point x* is stable or unstable without having to sketch the 
graph of f. 

If the graph of f near x* looks as shown in the left-hand side of Figure 7.5, 
i.e. if f’(x*) <0, then the point will be stable, while if the graph of f looks 
like the right-hand side, i.e. if f’(x*) > 0, then the point will be unstable. Only 
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stable unstable/semi-stable 
unstable/semi-stable unstable 


Pe ga 


Fig. 7.6. The various stability possibilities when f’(x*) = 0. 


when f’(x*) = 0 is there any ambiguity; there are four possibilities, pictured in 
Figure 7.6. The top right and bottom left cases are sometimes called ‘semi-stable’, 
since the stationary point is stable ‘on one side’ and unstable on the other. 


7.4 Structural stability and bifurcations 


Observe that if f’(x*) 4 0 then making small changes to the function f will not 
have a significant effect on the graph of f near x*. (The easiest ‘small change’ to 
imagine is adding or subtracting a constant, which will pull the graph of f up or 
down.) After the change there will still be a stationary point close to x* with the 
same stability properties (see Exercise 7.11). 

However, if f’(x*) = 0 then we can make small changes to f(x) and drasti- 
cally affect the ‘picture’ near x*. For example, in the top right case of Figure 7.6, 
increasing f(x) by any constant c > 0 will mean we no longer have a stationary 
point. 

When we make a small change to f but the phase diagram changes drastically 
we say that the equation has undergone a bifurcation. In these simple examples, we 
cannot have a bifurcation near x* unless f’(x*) = 0. When small changes to f (x) 
cannot effect the qualitative nature of the phase diagram we say that x = f(x) is 
structurally stable. 

We will look at a particular example of a bifurcation in Section 7.6; watch for a 
stationary point with f’(x*) = 0. 


7.5 Some examples 


We now consider various examples using this graphical method. 
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Fig. 7.7. The phase diagram for the population model (7.4). Note that this picture 
includes solutions with p < 0, which, while mathematically sensible, are irrele- 
vant for this application. 


7.5.1 A population model 


The equation 


dp _ P 
—=kp(l1-— with k,M>0 (7.4) 
dt M 
is a model for the change in the size of a population.” We will study this model 
in more detail in the next chapter, but for now we try to understand its qualitative 
behaviour. 
The first step is to find the stationary points. These occur where the right-hand 


side is zero, i.e. when 
DP 
kp (1— 3) =9 

so they are p = 0 and p = M. If we sketch the graph of f(p) = kp( — (p/M)) 
then it is easy to draw the phase diagram, remembering that solutions move to the 
right whenever f > 0 and to the left whenever f < 0. The phase diagram is shown 
in Figure 7.7. It is easy to see from the diagram that provided that we start with a 
positive population then it will eventually settle down to the value at the stationary 
point p = M; smaller populations will tend to increase, while larger populations 
will shrink towards this value. 

We can check to stability of the stationary points analytically by looking at the 
derivative of f, 


f'(p) =k — (2kp/M). 


At the origin f’(0) =k > 0, and the origin is unstable (as we expected), and 
f'(M) = —k < 0, confirming that the stationary point at p = M is stable. 

Note that this kind of solution can tell us what happens eventually, and how 
the population changes qualitatively. But it only allows us to make a very limited 


2 There are some very reasonable objections to this model; in particular, unlike the number of people in a popu- 
lation, the variable p does not have to be an integer. We can get round this to some extent by claiming that p is 
‘the population in millions’, and then p can be a decimal. There are still values that do not correspond to whole 
numbers of people, but the equation will now be a good approximation. 
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Fig. 7.8. The phase diagram (on the right) reflects the behaviour of solutions (a 
collection of which are shown on the left). 


number of quantitative predictions. For example, if we know that equation (7.4) is 
the right model for our population, but do not know the values of k and M, there 
is no way that we can use our phase diagram to find k and M given a collection 
of data. However, this is possible using the explicit solution, as we will see in the 
next chapter. 

Figure 7.8, shows how the phase diagram (rotated to be vertical in the figure) 
reflects the behaviour of the solutions, a collection of which are plotted against f. 


7.5.2 Terminal velocity 


Sometimes we do not need an explicit solution to find the quantitative information 
we require. Here we will use the phase diagram to find the terminal velocity of a 
falling object. 

Suppose that a body of mass m is falling under gravity g and is subject to an 
air resistance proportional to the square of its velocity, kv. The equation for the 
downward velocity v is 


ae ku|v| (7.5 
ma me kal, ) 


since gravity serves to accelerate the particle, and the air resistance acts in the 
opposite direction to v. 

Provided that the particle is moving downwards, so that v > 0, equation (7.5) 
becomes 


du 
— = mg — kv’. 
Le mg v 
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(mak) 


Fig. 7.9. You can simply read off the terminal velocity v = /mg/k. 


We can rewrite this as 


dv k 5 
Ap ee ee 


For this equation there is only one stationary point, when 
g——v' =0, 


i.e. when v = v* = ./mg/k. This point is stable, as can be seen by looking at 
the derivative of f at v*: since f’(v) = —2kv/m we have f'(v*) = —2./gk/m 
< 0. 

The phase diagram is shown in Figure 7.9; it is clear that there is an attract- 
ing stationary point at v = ./mg/k. This is the terminal velocity, since v(t) ap- 
proaches this value whatever the initial condition. 

For a skydiver of mass 100 kg in freefall, we can take k © 1/3 kg/m and g © 
9.8 m/s?. It follows that the terminal velocity of the skydiver is 


v = V 100 x 9.8 x 3 © 54.2 m/s. 


7.5.3 What have we lost? 


We will now see that we are missing some, at times vital, information if we only 
rely on the phase diagram. The phase diagrams for the two equations x = |x| and 
x = x? are shown in Figure 7.10: although the equations are different their phase 
diagrams are the same. 

In the next chapter we will see how to calculate the solutions of both of these 
equations. For now we will assume that we know what these solutions are; for an 
initial condition x(O) = xg > 0, the solution of x = |x| is 


x(t) = xoe’, 
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Fig. 7.10. Different equations can have the same phase diagram; on the left is the 
phase diagram for x = |x|, and on the right the phase diagram for x = x7. 


while the solution of x = x? is 


1 
x) = ——, 
xy —t 
(cf. (6.7)). 

The first solution increases to infinity, but is defined for all t > O (in fact for 
all t € IR). However, we have already used the equation x = x7 to show that it is 
possible for solutions to blow up in finite time (x(t) > coast > Xo a This finite 
time blowup behaviour is not captured in any way by our phase diagram. 

So although the phase diagram gives the correct qualitative behaviour, we have 
lost all information on the rates at which things happen. When the equation exhibits 
blow up of solutions in a finite time, this is particularly unfortunate. 


7.6 The pitchfork bifurcation 


We now consider the equation 
x = x(k —x°), (7.6) 


where k is a parameter. By varying k we can study a whole family of differen- 
tial equations. We will see that the qualitative behaviour of the solutions of (7.6) 
changes drastically as k passes through zero. 

When k <0 there is only one stationary point, that at x = 0. If we write 
f(x) = x(k — x”) then f’(x) = k — 3x”, and at the origin we have f’(0) =k. It 
follows that for k < 0 this stationary point is stable. The phase diagram is shown in 
Figure 7.11, together with the graph of f/f. 

When k = 0 there is still only the one stationary point at x = 0, although now 
f'(O) = 0. In order to determine the stability of the origin when k = 0 we have to 
sketch the graph of f(x) = —x?. It is then clear that the origin is still stable, and 
that the phase diagram is the same for k = 0 as it was for k < 0, see Figure 7.12. 

Since f’(0) =0 when k = 0, there is the possibility of a bifurcation as k 
changes from negative to positive (see Section 7.4). When k > 0 there are two 
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Fig. 7.11. The phase diagram when k < 0, and the graph of f. 


Fig. 7.12. The phase diagram when k = 0, and the graph of f(x) = —x?. 


Fig. 7.13. The phase diagram when k > 0, and the graph of /f. 


new stationary points at x = +/k. While the origin is no longer stable, since 
f'(0) =k > O, the new fixed points are both stable, since f/(4Vk) = —2k < 0. 
The phase diagram for this case is that shown in Figure 7.13. 

You can see that the phase diagram has changed drastically as k has gone from 
being negative to positive. We have gone from having one stable stationary point 
for k < 0 to having three stationary points when k > 0; two of these are stable, 
and the origin has become unstable. 

We can draw a ‘bifurcation diagram’ to show these changes. The idea is to draw 
a graph where the horizontal axis represents the parameter k, and for each value 
of k we plot the location of the stationary points on the vertical axis, using a solid 
line when they are stable and a dashed line when they are unstable. This gives 
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Fig. 7.14. The pitchfork bifurcation; stationary points are plotted against k. Solid 
lines indicate stable points, and the dashed line an unstable point. 


the picture in Figure 7.14. For fairly obvious reasons this is known as a ‘pitchfork 
bifurcation’. 


7.7 Dynamical systems 


The qualitative approach we have adopted here is the main viewpoint used in the 
general theory of dynamical systems. A dynamical system has two components: the 
phase space (or ‘state space’), which consists of all possible ‘states’ of the system 
(for the scalar equations of this chapter this is the line R, covering all possible 
values of x), and the ‘dynamics’ which describe how these states change in time 
(for us the dynamics were determined by the solutions of the differential equation 
x = f(x)). 

With the advent of more powerful computers there have been major advances in 
the theory of dynamical systems in recent years, and the subject received a lot of 
attention in the 1980s under the media-friendly “chaos’ banner. We will see more 
examples of dynamical systems later in the book. 


Exercises 


7.1 For each of the following differential equations draw the phase diagram, labelling the 
stationary points as stable or unstable. 
Gi) x =-x+1 
(ii) x = x(2—x) 
(iii) x = (1+ x)(2 — x) sinx 
(iv) x = —x(1 — x)(2— x) 


(v) xX =x? —x4 


7.2 


73 


74 


7. 


Nn 


7.6 


7.7 
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For the equations in Exercise 7.1 determine the stability of the stationary points ana- 
lytically, by considering the sign of the derivative of the right-hand side. 
For all positive values of c find all the stationary points of 


dx valet 
— =sin C, 
dt . 


and determine analytically which are stable and unstable. Draw the portion of the 
phase diagram between —7 and 7. There are three different cases,0 <c < 1,c=1, 
and c > 1. You will need to be more careful with the case c = 1. 

A simple model of the spread of an infection in a population is 


H =-—-kIH 
IT=klIH, 


where H(t) is the number of healthy people, /(t) the number of infected people 
and k the rate of infection. Since (d/dt)(H + J) = 0, it follows that the size of the 
population is constant, H + I = N, say. Substitute J = N — H in order to obtain a 
single equation for H(t), 


dH 
= —-kH(N — BA). 
dt 


Determine the stability of the stationary points for this equation, and draw its phase 
diagram. Deduce that eventually all the population becomes infected. 
Consider the equation 


dx _ age 
Gon Ks 


Draw the phase diagram for the three cases k < 0, k = 0 and k > 0, labelling the 
stationary points as stable or unstable in each case. Find the stability of the stationary 
points using an analytic method when k > 0. Show that f’(0) = 0 when k = 0. Why 
is this significant? 

Draw the bifurcation diagram, with k on the horizontal axis and the fixed points 
plotted against k, indicating stable fixed points by a solid line and unstable fixed 
points by a dashed line. (This is known as a saddle node bifurcation.) 

Draw the phase diagram for the equation 


& = g(x) =kx — x? 


fork < 0,k = Oandk > 0. Check the stability of the stationary points by considering 
g’(x), and show that the two stationary points exchange stability as k passes through 
zero. Draw the bifurcation diagram for this transcritical bifurcation. 

One equation can exhibit a number of bifurcations. Find, depending on the values of 
k, all the stationary points of the equation 


x =h(x) =—-(1 +x)? —k) 
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and by considering h’(x) determine their stability. At which points, and for which 
values of k, are there possible bifurcations? 

Draw representative phase diagrams for the five distinct parameter ranges k < 0, 
k=0,0<k <1,k =1andk > 1, and then draw the bifurcation diagram. Identify 
the type of the two bifurcations. 


In the remaining exercises assume that f is a C! function, i.e. that both f and df/dx are 
continuous functions. Note that such an f is smooth enough to guarantee that the equation 
x = f(x) with x(to) = xo has a unique solution. You may also assume that the solutions 
are defined for all t > 0. 

7.8 (T) Let x(t) be one solution of the differential equation 


T 


Ne} 


ee 


% = f(x). 


Show that 

(i) if f(x(t*)) = 0 for some ¢t* then x(t) = x(t*) for all t € R (the solution is con- 
stant, and x(t*) is a stationary point); and hence 

(ii) if f(x(t*)) > 0 for some f* then f(x(t)) > 0 for all t € R (the solution can- 
not ‘reverse direction’). Hint: Use the Intermediate Value Theorem: if g is 
a continuous function with g(a) <0 and g(b) > 0 then there is a point c 
between a and b with g(c) = 0. 

Of course, a similar result to (ii) holds if f(x(t*)) < 0 for some t*. 

(T) Show that for autonomous scalar equations, if x* is attracting then it must also be 

stable. Hint: use (11) above. 

(T) Suppose that x(t) is a solution of x = f(x) that is moving to the right. Show that 

either x(t) — +-oo, or x(t) > x*, where x* is a stationary point. (Hint: If x(t) does 

not tend to infinity then it is increasing and bounded above, and so tends to a limit 

x*, Show that in this case we must have f(x*) = 0.) A similar result holds if x(t) is 

moving to the left, with +-co replaced by —oo. 

(T) Suppose that x = f(x) has a stable stationary point at x9, with f(xo) < 0. Let 

g be another C! function. Use the following scalar version of the Implicit Function 

Theorem to show that for € sufficiently small the equation 


x= f(x) + €g(x) 
has a unique stationary point near x9 which is still stable. 


Theorem. Suppose that h(x, €), dh/dx, dh/de are all continuous functions of both 
x and €. Suppose also that h(xo, 0) = 0 and dh/0x (xg, 0) 4 0. Then there is an open 
interval 7 that contains xo such that for each € sufficiently small there is a unique 
solution y(e€) € J of 


h(y(e),€) =9, 


and y(€) depends continuously on e. 


8 


Separable equations 


We now begin our survey of the various different classes of equations that we can 
solve explicitly. Both the ‘trivial’ equations 


dx 
Pr = f(t) 


of Chapter 5 and the autonomous equations 


F=f) 


of the previous chapter are particular cases of the separable equation 


dx 81 
ae f(x)g(t) (8.1) 


which we study in this chapter. 


8.1 The solution ‘recipe’ 


If you have already seen these equations, then you will probably be used to solving 
them in the following way. If these equations are new to you, take careful note; this 
is the practical way of finding a solution. However, there are steps here that should 
make you uneasy. 
We start with the equation 

dx 

—= t). 

a = f@)8O 
Now divide by f(x) and ‘multiply up by dz’ to obtain 


1 
7) dx = g(t) dt. 


59 


60 8 Separable equations 


This is “separating the variables’, since we now have all the xs on one side and all 
the fs on the other. For the general solution we integrate both sides to get 


1 


Alternatively, if we want to take into account an initial condition x (to) = xo then 
we integrate between the limits that correspond to times to and t: for the left-hand 
side these are x(fo) and x(t), while on the right-hand side they are just fo and tf. 
This gives 


eee sates 8.3 
[ aa va | ali) df. (8.3) 


We now use this recipe to find the solution of the equation we used in Chapter 6 
to show that the solutions of a differential equation can blow up in a finite amount 
of time (i.e. x(f) > +00 as t > t* < oo). At the time we had no method for 
solving this equation, and just wrote down the solution, but now we can use the 
separation method to find it for ourselves. 


Example 8.1 Find the solution of the initial value problem 
dx 2 

—— = 0) = 7 

a x(0) = xo 
If xo = O then x(t) = 0 for all t. Otherwise we can separate the variables to give 


a dt 
—dx = dt. 
ye 


Integrating between limits corresponding to times 0 and f, 


x(t) 1 t 7 
/ 2 dx = - dt, 
x0 x 0 


we obtain 
1 x(t) 
fe 
X Ix=xo 
Therefore 
1 7 1 _ 
x(t) xo | 
which simplifies to give 
1 
x(t) = = ark, 
xy —t 


as we claimed before (cf. (6.7)). 
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8.2 The linear equation x = Ax 
We now find the solution of the simplest possible linear differential equation, 


dx 
dt 
with the initial condition x(f9) = xo. 
This example is absolutely fundamental (the reasons for this will become appar- 
ent later) and you should really only have to solve this equation ‘long-hand’ once 
or twice before you are happy to write down the solution with no calculation. 
First note that if x9 = 0 then x(t) = 0 for all t. Otherwise, if x 4 0 then we can 
divide by x and ‘multiply up by d?’ to give 
dx 


—=A)dt. 
x 


= Ax (8.4) 


We now integrate both sides between the limits corresponding to the times fo 
and t; that is, xo and x(t) on the left, and ¢o and ¢ on the right; and get 


x(t) dx t 7 
/ o i adi, 
x0 x ut) 


x(t) 
fini | SE oRy 


X=X0 


which gives! 


So we have 
In |x(t)| — In |xo| = A(t — fo), 
and taking exponentials (e to the power) of both sides gives 


|x (t)| = eh(t—to). 
xo| 


To work out what to do about the modulus signs, the easiest thing is to draw the 
phase diagram. For the case A > 0 this is shown in Figure 8.1, from which we can 
see that x(t) and xo have the same sign. It follows that we can remove the modulus 
signs and multiply up to give 


x(t) = xp), 


¥ Many of the integrals in this chapter will involve logarithms, and the annoying modulus signs that come with 
them. We will have to take some care to work out how to remove them for our final answers; the most useful 
method is to use the phase diagram, as the examples show. 
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Fig. 8.1. The phase diagram for x = Ax when A > 0 (taking A < 0 would reverse 
the direction of the arrows). 


Note that the general solution of (8.4) is 
x(t) = Ae”, 


see also Exercise 8.5. 


8.2.1 Exponential decay and exponential growth 


We looked at the solution of equation (8.4) with A < 0 in Chapter 1, and applied 
it to the example of radioactive decay. We saw that the solutions decay to zero 
exponentially fast, and that the rate of decay could be characterised by the half- 
life; the solution halves in a fixed time. 

When A > 0 the solutions tend to infinity as tf + oo, and increase exponentially 
fast. In this case the size of the solution will double after a fixed time, given by fo, 
where 


2= er! 


Le. f2 = 1n2/X. In the following section we look at the use of this linear equation 
as a simple population model. 


8.3 Malthus’ population model 


The simple linear equation 


dp _ 

a 
was proposed in 1798 by the English economist Thomas Malthus as a basic model 
for population growth. Here the increase in the population is taken to be propor- 
tional to the total number of people, and k is a constant representing the rate of 
growth (the difference between the birthrate and the deathrate). This model pre- 
dicts exponential growth of the population, 


kp with k>0 (8.5) 


p(t) = p(to)ek"—, 
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so that its size grows without bound and will double every d years, where d = 
In2/k. This led Malthus to see war and famine as a possibly desirable check on 
this otherwise disastrous population explosion. 

We will now compare the predictions of this model with census data gathered 
over the last two hundred years. The population of Great Britain and Ireland in 
1801, 1851 and 1901 can be found in the results of the Census for each of those 
years: 


year population 
1801 16345 646 
1851. 27533755 
1901 41609091 


(8.6) 


We can use the data from 1801 and 1851 to estimate k. Our solution predicts 
p(1851) = p(801)e™, 


and so 

_ In p(1851) — In p(1801) 

7 50 

This implies that the population will double roughly every 69 years (In2/k ~ 69). 
Using this value of k, our solution, illustrated in Figure 8.2, gives a reasonable 

prediction for the population in 1901: 


k = 0.010. 


p(1901) = p(1801)e!0% ~ 46 million. 


Population (millions) 
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Fig. 8.2. The UK population as predicted by Malthus’ linear model. The census 
values for 1801, 1851, 1901 and 2001 are indicated by crosses. 
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However, it vastly overestimates the population in 2001 as 
p(2001) = p(1801)e ~ 131 million, 


whereas the 2001 census found just below 59 million (in fact” the figure is the 
delightfully precise 58 789 194). To be consistent we should include the figures 
for the Republic of Ireland, since the data in (8.6) dates from before the partition 
of Ireland in 1921. The census held there in 2002 found a population of around 
4 million.? So the total figure for 2001 should be approximately 63 million. 
Malthus’ model has predicted over twice this, so it turns out to be very unreli- 
able when we try to extrapolate the population very far into the future. We will 
soon see another model that gives much more realistic results. 


8.4 Justifying the method 


We now give a careful justification of the ‘recipe’ we outlined in Section 8.1. In 
particular, you should have worried about the idea of ‘multiplying up by dt’, since 
this kind of manipulation of infinitesimal quantities is extremely dubious. 

We start again with 


dx 

a I 8, (8.7) 
and assume that f (x) is sufficiently smooth to ensure that Theorem 6.2 guarantees 
the existence of a unique solution for any specified initial condition. 

First note that if x(t) is a solution of (8.7) with f(x(s)) = 0 for some s then in 
fact x(t) = x(s) for all t € R. This follows from the uniqueness of solutions; as- 
suming that x(t) = x(s) for all t implies that f(x(t)) = f(x(s)) = Oforallt € R, 
and so x(t) = 0 for all t, showing that this choice for x(t) solves the equation. 
Since solutions of the IVP are unique, this x(t) must be the only solution with the 
specified value of x(s). 

So either f (x(t)) = 0 for every value of t, or f(x(t)) 4 0 for every value of t. 


We now treat the case f(x(t)) 4 0 for all t, for which we can divide both sides of 
(8.7) by f(x) to give 

= 3) (8.8) 
fod. ° 


Now, suppose that H (x) is an anti-derivative of 1/f (x), ie. 


1 
H’ = ——, 
VO a) 


2 See www. statistics .gov.uk/census2001/default.asp 
3 The exact figure was 3 917 336, see www. cso. ie/census/prelimimary_details.html#pop. 
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Then observe that by the chain rule (see Appendix C) 


dx 1 dx 


d / 
qe» = H(x(t)) ip F@) de 


and so (8.8) can be rewritten as 
Hi (x(t)) = g(t) 
deer 
To find the solution we can integrate both sides with respect to ¢ to give 
A(x(t)) = / g(t) dt. (8.9) 
(To find x(t) explicitly we have to be able to invert H, i.e. solve the equation 
H (x) = z to obtain x in terms of z. In some cases the implicit form of (8.9) might 


be the best that we can do.) Since A is an anti-derivative of 1/f, we could write 
this symbolically as 


1 
ls ar= f ear (8.10) 


which is precisely what we had before as equation (8.2). 
We now see how to recover (8.3) (see equation (8.11) below). If G is an anti- 
derivative of g then (8.10) reads 


A(x(t)) = Git) +c, 
and so when we want to take into account an initial condition x (to) = x9 we need 
H (x0) = G(to) + => c = H(x0) — Gt), 
and the solution is 
H (x(t)) — H(x0) = G(t) — G(to). 


Using the method of evaluating an integral by anti-derivatives (which is formalised 
as (5.3) in the FTC) we can rewrite this as 


ee d t) dt. 8.11 
‘a a r= | @ i, (8.11) 


which agrees with the result of our more heuristic derivation above (equa- 
tion (8.3)). 
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8.5 A more realistic population model 


We now return to population modelling, but rather than allowing the unbounded 
exponential growth that resulted from Malthus’ model 


dp 
a 
we impose a maximum sustainable size for the population. The idea is that any 
species (including ours) is limited by the availability of natural resources. We will 
find that this new model gives a much better estimate of the current population, 
even extrapolated from the century-old data we used above. 
The so-called ‘logistic equation’ is 


P — kp (1 = =). (8.12) 


Interpreted as a population model, k is the growth rate of small populations; when 
p is small, p? is very small, so the equation is approximately dp/dt = kp, the 
model we had previously. The parameter M > 0 is the maximum sustainable pop- 
ulation; when p < M the population increases, and when p > M the population 
decreases. We drew the phase diagram for this equation in the previous chapter as 
Figure 7.7, and it will be useful to recall it now (see Figure 8.3) for use below. The 
phase diagram predicts that eventually the population will settle to its maximum 
sustainable level, p = M. 
We now solve the equation explicitly. Separating the variables gives 


kp 


——_—__dp = dr, 
kp(M — p) 


where we have multiplied top and bottom of the left-hand side by M. Using the 
method of partial fractions on the left-hand side this becomes 


:|=+ Jer=ar 
kL_Lp M-—p 
or 
|= + : | ap = kar (8.13) 
po Mp 
x > S< <q 
0 M 


Fig. 8.3. The phase diagram for the population model (8.12). In line with the 
interpretation of p as the size of a population, only the values p > O are shown. 
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Since 


1 1 
[ot per = nel nit - 


We can integrate both sides of (8.13) between the limits corresponding to times fo 


and f, 
i 1 1 ft 
—+ dp =) k dt, 
P(to) P M—p i) 


p(t) Ba (3 
fn ipl mim ~ pr | =|] ‘ 
P=P(to) t=I0 


Putting in the limits of integration, 


to give 


In p(t) — In|M — p(t)| — In p(to) + In|M — p(to)| = k(t — to) 
(since p(t) > O we do not need the modulus sign on In | p(t)|). Equivalently this is 
oe — p(to)| 
|M — p(t)|p(to) 


From the phase diagram in Figure 8.3 it is clear that if p(t9) < M then p(t) < 
M for all t, and similarly if p(to) > M then p(t) > M for all t. So the sign of 
M — p(t) does not change for each solution. It follows that we can remove the 


=KE=%): 


modulus signs, and then exponentiating both sides we obtain 
POM — PO)) _ run) 
(M — p(t))p(to) 


Finally, rearranging this gives 


pit)=M p(to)ek'—0) | | 


M — p(to) + p(toyek@—t0) 

With some thought we can read from this explicit solution the same qualitative 
behaviour we see in the phase diagram. In particular, since e 
we can deduce once again that p(t) — M ast > oo. 

Since we have an explicit solution we can now estimate the parameters M and 


> wast wo, 


k that occur in the equation using the census data quoted above in (8.6). Once 
we know M and k we can then see what the quantitative predictions of the model 
are for 2001. The calculations to find M and k are just simple algebra, but are 
not particularly instructive, so feel free to go straight to the values of M and k in 
equation (8.14). 
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Setting a = e"*, py = p(1801), py = p(1851), and p2 = p(1901), our solu- 
tion requires 


Mpoa 
A= 
M — po+ pow 
Mpoa? 
p2 = ————... 
M — po + pow 


(Since there are now two parameters in our equation we need all the data from 
(8.6) to estimate them.) Rearranging both equations to find M in terms of a and 
equating we have 


popi(@—1) _ pop2(a* — 1) 


Poe — pi pow? — pz 
and so 
P2(P1 — Po) Pi(2pop2 — Pip2 — PoP) 
SS and M — 5) < 
Po(p2 — P1) PoP2 — Pj 


Using the correct values for po, p; and p2 gives 
a=e* —20234 and M =83.1 million (8.14) 


which implies that k ~ 0.014, similar to the value (k ~ 0.010) we found for the 
simple exponential model (8.5). 

We can now use these values of k and M to predict the population in 2001. 
The value we obtain is 66.8 million, surprisingly close to the true figure (which, 
remember, is about 63 million); the solution is illustrated in Figure 8.4. Of course, 
there are good reasons for the discrepancy — among them two world wars and the 
invention of the contraceptive pill in the 1960s. 

Note that the constant M that arises in the model represents the maximum sus- 
tainable population; at 83 million this is still comfortably above its current level. 


8.6 Further examples 


We now treat some other examples. Note that often both the method of partial 
fractions and a quick sketch of the phase diagram are useful tools. 


8.6.1 Partial fractions again 


We drew the phase diagram for the equation x = x(k — x”) as part of our study of 
the pitchfork bifurcation in Section 7.6. Here we will consider the case k > 0, and 
so replace k by x, which will make the algebra that is to come a little simpler. We 
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Fig. 8.4. Graph of the population of the UK and the Republic of Ireland, as pre- 
dicted by the logistic model using the census data for 1801, 1851 and 1901. The 
curve is our theoretical prediction, and the crosses show the exact values (we have 
chosen our parameters to ensure that the first three crosses lie on this curve). The 
dashed line is the maximum sustainable population predicted by our model. 


will solve the equation 


with a general initial condition x(0) = xo. For the case k < 0 see Exercise 8.8. 
Separating the variables we have 


1 


— ~—- dx = dt. 
x(k? — x2) . 


We can use the method of partial fractions to rewrite the left-hand side as 


Lo I oi E eu 1 
x(k2—x2) x(k —x)\(K +x) K2 |x 2k—-—x) W2K+x)1/ 
So we have 


1 1 
x oD (WK +x) 


dx =x? dt. 


We can integrate this between the limits corresponding to times 0 and f to give 


x(t) t 
i: Mi ar / ade 
= } — K 
iy oe 206 =X). 20x) 0 
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—K 0 K 
Fig. 8.5. The phase diagram for * = x(k? — x). 


which is 


x(t) t 
[nist — bine — fine +a | = | 


X=X0 


|x| i‘ ‘ 
In —————_ = Kt. 
|x? — x?| Jr=xo 


Putting in the limits this becomes 


or 


Ix(t)|,\/ 1K? — x6 ; 
= Kt, 
Ixoly lk? — x(t)?| 


and exponentiating both sides we have 


t)|,/|K2 — xé 
Ix(t)|,/|K- — x6 et 


Ixol/le2 — x(02| 


Now if we square both sides and multiply up we have 


x(t)2|a2 — x2] = x2|a? — x(t)*[e2". (8.15) 


We drew the phase diagram for this example in the previous chapter (see 
Figure 7.13), and it is reproduced here as Figure 8.5. It is easy to see from the phase 
diagram that if x6 < «7 then x(t)? <x? for all ¢ and similarly if Xe > «> then 
x(t)? > k? for all t. So we can remove the modulus signs and rearrange (8.15) to 
give 


i0)- = 
K2 4 xp (e2e"t —1) 


or 


2 
= + | aa (8.16) 


107278 (2x07 — 1) 
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t 


Fig. 8.6. Solutions of x = x(k? — x”) on the left, and the corresponding phase 
diagram (rotated through 90 degrees) on the right. 


Whether we take the plus sign or the minus sign depends on the sign of the 
initial condition; for f = 0 we obtain 


x(0) = +y/x2 — +|xo| 


and we have to choose the sign so that x(0) = xo. 

From the phase diagram we can see that for any x9 > 0 the solution tends to «x 
as t —> oo, and for any xo < 0 the solution tends to —x; we can also recover this 
behaviour from our explicit solution, since if 0 < x9 < x then ent Cae a= 1) 
is always positive, and decreases from its initial value to zero as t > ov; it follows 
that x(t) increases from xo to x. Similarly if x9 > « then e271 (2 Bs 1) is 
always negative, and increases up to zero as t + ox, so that x(t) decreases to k as 
t > 00. 

You can also see from the explicit solution that if |xo| > « then the solution 
will blow up ast | t* < 0, since then the expression (K7x9 2 _ 1)is negative (see 
Exercise 8.9 for more details). 

Figure 8.6 shows the solutions of the equation, along with the corresponding 
phase diagram, rotated to illustrate how the behaviour of the solutions matches the 
predictions of the phase diagram. 


8.6.2 Two competing species 
Later we will look at some simple models of competing species, and will come 
across such equations as 
dy = yx — 2) 
dy w= 3y)- 
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Fig. 8.7. Curves on which In y + 2Inx — 3y — 5x is constant. 


This can be separated to give 
1-3 5x —2 
y i= x 
y Xx 


Integrating both sides we have 


[()o=f(-2)s 


taking x and y positive, because they represent the size of a population, we have 
no need of modulus signs in the logarithms arising from the integration, 


dx. 


Iny —3y =5x —2Inx+c. 


We can do no better than this implicit solution relating x and y. However, we can 
represent the curves defined by 


F(x, y) =Iny+2Inx — 3y — 5x = constant 


graphically, and these are shown in Figure 8.7. 


Exercises 


8.1 Solve the following equations: 
(i) x = P(1 — x) with x(0) = 3; 
(ii) y’ = (14+ y7) tanx with y(0) = 1; 
(iii) x = 12x (general solution); 
(iv) x = —x? (general solution); 


8.2 


8.3 


8.4 


8.5, 


8.6 


8.7 
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(v) for dy/dt =e" *y2 give the solution in terms of an integral and describe the 
behaviour of the solution as tf +00 depending on the initial condition y(0). 
You may assume that in eds = ft /2. 

Solve the linear equation 


x+px=q 


by separation of variables. 
Find the general solution of the equation 


xy’ = ky 
that is valid for x > 0. 
Find the function / (t) that satisfies 
dl oy 
ae 


(Your answer will involve an integral.) 
Use the method of separation of variables to show that the general solution of the 
linear equation 


x=dx 


is x(t) = Ae“ for any A ER. 

In Exercise 5.7 we showed, neglecting air resistance, that an apple falling from a 
height h reaches the ground when t = ./2h/g. If we include air resistance then pro- 
vided that v < 0 the equation becomes 


du 4 
; =-—mg+kv v(0) =0 


v(t) = — [= tanh (a) ; 


and hence that the apple now takes a time 
m= /— In Ca — Verkh/m — 1) 
& 


to reach the ground. Check that this coincides with the answer with no air resistance 
(t* = /2h/g) as k — 0. Hint: for small x, e* © 1+ x and In(1 + x) © x. 
Show that for k ~ 0 the solution of the differential equation 


with k > 0. Show that 


d 
ay ee 


dy with x(0) = xo 


k ek x9 


OSD TE 
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Using this explicit solution describe the behaviour of x(t) as t > oo for k < 0 and 
k > 0. (Note that this is much easier to do using the phase diagram than using the 
explicit form of the solution.) For k = 0 see part (iv) of Exercise 8.1. 
8.8 Show that the solution of the equation 
dx 


ae Se XP) 


with initial condition x(0) = xg is 


K2 
x(t)=+ , 
(1 + 2x5 *)e%7# -—1 


where the + is chosen according to the sign of the initial condition. Deduce that 


x(t) > Oast — oo. Ast decreases from zero the solution blows up as t approaches 
a finite value t* < 0. When is this ‘blow up time’? 
8.9 We found the solution of the equation x = pale - x) in Section 8.6.1, 


K2 
x(t) =+ : 
1+ e724 (K2x97 - 1) 


Show that if |xo| > |x| the solution blows up as ¢ decreases towards a finite negative 


value, and find this critical time. 
8.10 Consider the equation 


x =x” with x(0)>0 


for a > 0. Show that the only value of a for which the equation has solutions that 
are both unique and exist for all time is aw = 1. You should be able to find an initial 
condition for which the solutions are not unique when a < | (cf. (6.3)), and show 
that solutions with x(0) > 0 blow up in a finite time if a > 1 (cf. (6.6)). 

8.11 (T) Assuming that f(x) and f’(x) are continuous, show that if the solution of 


x = f(x) with x(0) = x0 


blows up to x = +00 in finite time then 


[fou <0. 
x0 


9 


First order linear equations and the integrating factor 


One type of first order equation that we can always solve, at least in theory, is a 
linear equation. The most general first-order linear equation (cf. (3.5)) is 
dx 
ay(t)—— + ao(t)x = f(t). 
dt 
However, we will concentrate on equations that are ‘always first order’, so we 
assume that a;(t) 4 0 and divide through by aj (t) to obtain 
dx 
ag + p(t)x = q(t). (9.1) 
t 
9.1 Constant coefficients 
First we will consider the simplest case, when both p and q are constants, 
fs pe = (9.2) 
qe ee ; 


There are a number of ways to solve this equation. We have already met one, 
the method of separation of variables; we could write the equation as 


dx 
q — px 
integrate both sides, and solve it this way (see Exercise 8.2). 
However, we are going to solve it by another method. Although this way in- 


= df, 


volves a trick, and may seem complicated in this simple case, it is also useful for 
the more general equation (9.1) where the coefficients do not have to be constants. 
The key point is to notice that 


d d d 
ae (x@e") = en + pxeP?! =e?! & + pe) (9.3) 


75 
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(using the product rule). The right-hand side of (9.3) is the same as the left-hand 
side of our differential equation (9.2), except that it is multiplied by a factor e”". If 
we multiply both sides of (9.2) by e?’ we have 


d 
ePt & + pe) =qer" 


and using (9.3) this is simply 
£ (wer) = qe?" (9.4) 
a : ; 
For the general solution we integrate both sides to give 
x(te" = Lert +¢, 
DP 


so that 


x@)a 2 Gen? (9.5) 
Dp 


(It follows that if p > O then x(t) — q/p as t — ov, independent of any initial 
condition.) 
If we want the solution that has x(a) = xg then we need 


fo Gere => C= (1-2 )er, 
P P 


and so this solution is 


eee (x. = 4) e Pa) (9.6) 
P P 


9.2 Integrating factors 
We now use the same sort of trick on the more general linear equation 
dx 
dt 


What we are doing is looking for an ‘integrating factor’ by which we can multiply 
both sides and so turn the left-hand side into something we can integrate easily. 
We will give a quick derivation of the form of the integrating factor that we need. 
However, in practice you should just write down the integrating factor; as we will 


now see it is 
exp (| p(t) tr) ; 


+ p(t)x =q(t). (9.7) 


ie. ef POA, 
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If we multiply both sides of (9.7) by a factor J (t) then we get 


dx 
BO +1M)pOx =1@)q@). 
Concentrate on the left-hand side, 
dx 
Nae +1) px; 


we want this to be the derivative of something. The first term is part of the deriva- 
tive of I (t)x(t), so we will see whether we can find a function / (t) such that 


“Mx = 1a + I(t) p(t)x(t), 
dt dt 
i.e. such that (differentiating the left-hand side using the product rule) 
ot) ay 1a) = 1a) + I(t) p@)x(t). 
dt dt dt 
For this we would need 
d/ 
eG) aR); 


which is certainly true if 


ae 
qn 


This is a separable equation (see Exercise 8.4); we can divide by J and multiply 
up by df to give, 


1 
7 = pia) dt, 


and then by integration we get 


In |1(t)| = f roar, 


Finally we exponentiate both sides and choose / (t) to be positive to give 


I(t) = exp (| p(t) ar) : 


Given this integrating factor we should now be able to solve our general linear 
equation 


a thx = q(t 9.8 
ap 1 PO = 4). (9.8) 


If P is an anti-derivative of p (so that P(t) = p(t)) then the integrating factor we 
need is e? , Multiplying both sides of (9.8) by this integrating factor the equation 
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becomes 
d 
5, oO + pela e? =aieP®. 


The point of the integrating factor is that the left-hand side is now the derivative of 
x(t)e?™, so we have 


d 
qe l= amet. 


In order to solve the problem completely we have to be able to integrate the right- 
hand side, and then the solution is 


x(tjeP = / q(tje?™ dt 


We now apply this method to some examples. 


9.3 Examples 
Example 9.1 Solve the equation 
dx 
dt 
The integrating factor is 


+3x=t with x(0) = 8/9. 


I(t) = exp (f ser) =e, 


Multiplying both sides of the equation by e*’ we get 


dx 
aes + 3xe = te". 
dt 


The whole point of the method is that we can now rewrite the left-hand side as a 
derivative: 


d 3t 3t 
pars —d t : 
se )=te 


Integrating this equation with respect to ¢t between O and f — using integration by 
parts on the right-hand side —we obtain 


x(t)e*’ — x(0) = i) pe: di 


0 


lI 
| 
~ 
| @ , 
= 
| 
0 | re 
Ms 
— 
<j 
ll 
S 
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and so, since x(0) = 8/9, 
t 1 


—3t 
th= --— 7. 
x(t)=e +3 9 


Example 9.2 Find the general solution of 
d 
@ + iy + 4xy = 12x. 
dx 


This is a linear equation; if we divide both sides by x? + 1 then 


rae Ax — 2x 
dx a2aa°  x2bi 


which is in the form (9.1). The integrating factor is 
I(x) / ar 
= eX —=— 
Xx p ere Xx 
= exp(2In(x* + 1)) = (x7 + 1)”. 
So, multiplying both sides by (x? + 1)? we get 
2 2dy 2 2 
(x* + 1) qe + 4x(x* + 1)y = 12x(x* + 1), 
x 
which is 
d 2 2 2 
—[y(x* + 1)*7] = 12x(x* + 1). 
dx 
To find the general solution we integrate both sides to get 
y(x)(x? + 1)? = 3(x7 +1)? +c 


and so 
Cc 


y(x) =o a eer aay 


9.4 Newton’s law of cooling 


An interesting example of a linear equation arises from Newton’s law of cool- 
ing, which provides a mathematical model of the temperature T(t) of an object in 
surroundings of temperature A(t): 


ian k(T — A(t 9.9 
>| ani. — A(t)), (9.9) 


where k > O measures the rate that heat is absorbed (or emitted) by the object. 
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9.4.1 Estimating the time of death 


One forensic method for ascertaining the time of death of a body is based on 
Newton’s law of cooling. The idea is to take the temperature of the body at 
two different times, in order to give an estimate of the constant k to be used in 
equation (9.9), and then to extrapolate back to find the time when T is the temper- 
ature of a living body, 37 °C. 

To keep things simple we will suppose that a body is found in a room which is 
kept at a constant temperature of 24 °C. At 8 a.m. in the morning its temperature 
is 28 °C, while an hour later it is 26 °C. 

With the time ¢ measured in hours we need to find the solution of 


dT 
—+kT=kA 
dt 


(in fact we have done this already in Section 9.1). Multiplying both sides by the 
integrating factor e“ we obtain 


d k 
—[T (te — kAe*, 
q aif (r)e" | 
and then integrating both sides with respect to f between times t and fy gives 
T (t)e*2 — T(t)e™! = A(ek2 — e1); 


rearranging this gives the temperature at time f in terms of the temperature at 
time f1, 


Ty) =A+[T(H) — Ale WO. 
This implies that 
T(9) = A+[T(8) — Ale“. 
To find k we set T(8) = 28, T(9) = 26 and A = 24, so we then have 
26 = 24 + [28 — 24]e*, 
which implies that 
e*—0.5 


giving k = In2. 
If the time of death was fp then our solution gives 


T (8) =A+[T(to) — Ale *8-, 
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Since T (to) = 37, T(8) = 28 and A = 24, we want to find the value of to such that 
28 = 24 + [37 — 24]e*C-0), 
1.e. 
4 = 13e*8-0) | 
Taking logarithms gives 
In4 = In 13 —k(8 — 49), 


and using k = In2 we have 
In4 — In 13 
—In2 


Solving this for tg we obtain fo © 1.7, putting the time of death at approximately 
1:42 a.m. 


(8—f) = 


9.4.2 The temperature in an unheated building 


We now look at a case where the ambient temperature is not taken to be constant. 
To make things more definite, and a little less gruesome, we suppose that T(r) 
represents the temperature inside an unheated church. 

Once again we want to use the integrating factor method, so we rewrite equa- 
tion (9.9) as 


a +kT =kA(t); 
dt os 
as before, the integrating factor is e“’. Multiplying both sides by e*’ gives 
d k 
—[Te] = kA(re™, 
qe | (t)e 
and then integrating between 0 and f we get 
: ~ oe ~ 
T (the — T(0) = Kf A(ie™ df. (9.10) 
0 
Rearranging this gives 
t : 
T(t) =T(O)e™ +ke™ / Ae" di. 


0 


Now we model the outside temperature as a regular oscillation about an average 
temperature ju, setting 


A(t) = uw+acosat. 
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Fig. 9.1. The graph of A(t) = u+ acosat against t, showing the period and 
amplitude. 


We say that A(t) oscillates about a mean value jz; the size of the oscillations, a, 
is called the amplitude, and the frequency of the oscillations is how many occur 
for every unit of time, in this case w/27. The reciprocal of the frequency, 277/a, 
is known as the period: this is the time between any two successive maxima of 
A(t), 1.e. the time for one ‘complete oscillation’. These concepts are illustrated in 
Figure 9.1. 

Over a British year we can take the mean temperature as 9 °C, with the ampli- 
tude of oscillations 10 °C. With ¢ measured in years we then want w = 27 so that 
the oscillations have a period of one year. 

With this particular form for A(t) equation (9.10) becomes 


t 


t . is 
T (the —T(0) = ut | et ai +ak | e cos wi di. 
0 0 


An anti-derivative of eX cos wt is! 


ae cos wt + =——e" sin wt (9.11) 
k? + @ k? + 
and so we have 
kt ki ; k ki 7 w ki Pal 
e“ T(t) —T(O) = e + ak} ———e™ cos wt + ————e” sin wt 
ee H| Mes f== k? +o? im 


k 
= w(eX —1)+ak Feeerc cos wt + zat sin wt — 
0) @ 


! You might expect the anti-derivative to be of the form Ae* cos wt + Be* sinat, since differentiating the first 
of these two terms gives one term involving e* cos wt (which we want) but also an e“ sin wt term. Differentiat- 
ing this guess gives an equation for A and B which is straightforward to solve. There is also a more systematic 
way of obtaining this result using the complex form for cos wt and sin wt, see Exercise 9.8. 
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This is looking complicated, but we can rearrange it, in particular multiplying 
through by e~*", to give 


ak? | 4, k w 
T@H=ut [ro -w- zeal + ak] po c0s.ar + oP sinor| 

This still looks complicated,” but if we consider each term individually then we 
should be able to understand what this solution is actually saying about the temper- 
ature. The first term is the average outside temperature, which we would expect to 
form the main contribution to the temperature inside the church; the second term 
decays exponentially, so will have very little effect after some time has passed; and 
the last two terms both oscillate with the same frequency as the ambient tempera- 
ture. Before we write the equation in its final form we will see in the next section 
how to combine these two oscillating terms in order to make it clear that they result 
in just one oscillation. 


9.4.3 Combining two oscillating terms 


We are now going to show that it is possible to combine two oscillating terms and 
rewrite them as one: 


Acosat + Bsinwt = M cos(at — ¢). (9.12) 


(In our case the constants A and B are given by A =k/ (k* +7) and B= 
w/(k* + w”), but the argument is much easier to follow with the more general A 
and B.) 

The idea is to use the double angle formula? 


cosa cos 6 + sina sin B = cos(B — a@) (9.13) 


in an appropriate way. Note that the left-hand side of (9.12) looks slightly like the 
left-hand side of (9.13) if we choose B = wt; it would look just like the left-hand 
side of (9.13) if we could find a @ such that 


cos@=A and sing = B. (9.14) 


In general we are unable to do this, because we know that, whatever the value of 
0, cos? @ + sin* 6 = 1, and there is no reason why A* + B? should be equal to 


2 if you solve an equation and end up with a long expression like this then there are various ways that you 
can check your answer. You always have the option of differentiating and substituting back into the original 
equation to check that it works; sometimes this itself might be daunting. One thing that you can do quite quickly 
here is to check that at t = 0 the right-hand side reduces to T (0), as it should. 

3 The right-hand side could also be written as cos(a — 8), since cos 6 = cos(—8). 
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one. The way to circumvent this is to take out an appropriate factor from (9.12) 
and write 


Acosot + Bsinat = Var B| 


cos wt + 


A B . 
——————————— —————_ sin at |. 
J A2 + B2 J A2 + B2 | 


The coefficients of cos wt and sin wt within the square brackets now satisfy 


A ‘ B : 
OF + as = 1, 
(a) (jam) 
and so we can find a @ with 


A B 
cos ¢ = —————— and sin dé = ————_., (9.15) 
VA2 + B2 A? + B? 


which is just given by 
¢@ = tan” !(B/A). 
(One way to think about this is in terms of the right-angled triangle pictured in 
Figure 9.2.) 
With this choice of @ we now have 
Acosat + Bsinat = vy A? + B? (sS cos wt + eee sino) 
/ A2 + B2 / A2 + B2 
= V A? + B’ (cos¢cos at + sing sin wf) 
= / A? + B? cos(at — ¢). 


The sum of the two oscillations has amplitude M = V A? + B? and oscillates at 
frequency w/27, with a time lag of d/w. 


(A? + B?) 


A 
Fig. 9.2. A, B, ¢, and a right-angled triangle. 
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9.4.4 Back to our example 


In our particular example, we want to combine 


k 
nae) cos wt + 7 sin at; (9.16) 
(0) oO 


so if we take out the factor 


k2 ts w = eee od 
(ke +0°)? (ke +0°)?  \ (ke? +07) Vk? 40?’ 


and choose ¢ with 


o = tan !(w/k), 


then the two terms in (9.16) become 


| 1 
Pee cos(wt = p), 
and the full solution is 
T(t) =ut+/TO-p- a ees ak,| ee cos(wt — @). (9.17) 
=e 24 @? k2 + w : , 


For a particular choice of parameters this solution is shown in Figure 9.3, along 
with the ambient temperature. 


° 


temperature T( C) 


Fig. 9.3. The solid line is the solution (9.17) when A(t) = 9+ 10 cos 2tt, k = 3 
and T (0) = 12. The ambient temperature A(t) is shown as a dashed line. 
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Now we can see clearly what the solution describes. The underlying temperature 
is the average ambient temperature jz; this is adjusted by a term that decays to zero 
as t — oo, and an oscillating term. When ¢ is large the decaying term will be small, 
and the solution will be approximately 


| k2 
Tapprox (t) =U +a ae cos(wt = a); 


eventually the temperature oscillates at the same frequency as the ambient temper- 
ature, but its amplitude is a multiplied by the factor 


k2 
6 9.18 
k2 + @2 on 


Since cos(@t — a) = cos w[t — (a@/q@)] there is a time lag between the oscillations 
of the ambient temperature and the oscillations of T(t) which is 
Of eet (9.19) 
Oo Oo k 
Note that if the constant k is large, so that heat is easily conducted from the 
surroundings to the building, the factor in (9.18) will be close to 1, while the time 
lag will be close to zero; in such a case T(t) follows A(t) very closely. 


Exercises 
9.1 Use an integrating factor to solve the following differential equations: 
(i) 
d 
2 dee gt 
dx x 


(find the general solution and the only solution that is finite when x = 0), 


(ii) 
eS +tx =4t 
dao 
(find the solution with x(0) = 2), 
(iii) 
dz : 
— =ztany+smny 
dy 
(find the general solution), 
(iv) 
yte*y=1 


(find the solution when y(0) = e, leaving your answer as an integral), 


9.2 


9.3 


9.4 


9.5 
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(v) 
x +x tanht = 3 


(find the general solution, and compare it to that for x + x = 3), 


(vi) 
y’ + 2ycotx = 5 
(find the solution with y(7/2) = 1), 
(vii) 


cane t 
wei Pea 
dt 


(find the general solution), 
(viii) with a > 0 find the solution of the equation 


for a general initial condition x(1) = xo, and show that x(t) > b/a ast > c 
(you would get the same result if you replaced a + t~! by a). 
A body is found in a cold room (temperature 5 °C) at 3 p.m. and its temperature then 
is 19 °C. An hour later its temperature has dropped to 15 °C. Use Newton’s law of 
cooling to estimate the time of death, assuming that body temperature is 37 °C. 
At 7 a.m. in the morning I make my wife a cup of tea using boiling water; after adding 
some milk it is about 90 °C. When we leave for the station at 7:30 a.m. the tea is still 
drinkable at about 45 °C. When I get back home at 8 a.m. the neglected tea has cooled 
to about 30 °C. What is the temperature of our house? 
Use the integrating factor method to find T (t2) in terms of T(t;) when 


dT 
ap ee 
t 


and 
A(t) = u+acosa(t — ¢). 


A dead body is found outside on a winter’s morning at 7 a.m.; its temperature is mea- 
sured as 20 °C. Measured an hour later it has dropped to 15 °C. The air temperature 
A(t) fluctuates on a daily cycle about a mean of 3 °C with A(t) = 3 — 5cos w(t — 2), 
where f is measured in hours with t = 0 corresponding to midnight, and w = 7/12. 
(i) Use the solution from Exercise 9.4 and the temperature observations at 
7 a.m. and 8 a.m. to show that 


are 12(k2 + w2) — 5k(k cos 6@ + w sin 6w) . ois 


17(k2 + w2) — 5k(k cos 5@ + w sin 5a) 


(11) (C) This is a MATLAB exercise. Choose an initial guess for k, and then substitute 
this into the right-hand side of (E9.1) to obtain a new guess. Continue doing this 
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9.6 


9.7 


9.8 
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until your ‘guess’ stabilises. Once this happens you have actually obtained the 
required solution of (E9.1). Can you see why? (You should find that k ~ 0.3640.) 

(iii) If the time of death was fo, use the fact that body temperature is 37 °C (so T (to) = 
37) and T (7) = 20 to show that 


saci 17(k* + w*) — 5k(k cos 5@ + w sin 5a) 
— n 5 
° k | 34(k?2 + w2) — 5k(k cos (to — 2) + wsina(to — 2)) 


(iv) (C) Use MATLAB again to refine an initial guess for the time of death as in part 
(ii). You should find that f9 + 4.8803, or 4:53 a.m. 
Show that if yj and y2 are any two solutions of 


dy 
ae + p(x)y =0 
Xx 


then y; (x)/y2(x) is constant. (You do not need to solve the equation!) 
(T) Suppose that 


(this is known as a differential inequality). Use an appropriate integrating factor to 
show that 


d —at 
ues < 0, 
qe aS 
and then integrate both sides between appropriate limits to deduce that 
x(t) < x(sjet 


for any t and s. Hint: it is a fundamental property of integration that if f(x) < g(x) 


then 
b b 
/ fix)dx < / g(x) dr. 
a a 
(T) The function sin wt can be written as a combination of complex exponentials, 
elot = e viet 
sin ot = - 
21 


Using this form for sin wt, and assuming that the usual rules of integration apply to 
such complex exponentials, find 
i e™ sin wt dt. 


You may also need to use the identity 
elot abe eT iat 
2 


See Appendix A for more on these complex exponentials. 


cos at = 


10 


Two ‘tricks’ for nonlinear equations 


This chapter deals with two tricks that can be used to solve certain nonlinear equa- 
tions. Since these techniques can only be used for equations of particular kinds, it 
is important to be able to spot them. 


10.1 Exact equations 


Suppose that x and y are related implicitly by 
F(x, y) =c, (10.1) 


so that x and y form a ‘curve of constant F’. Then if we take the derivative of 
(10.1) with respect to x we get, using the chain rule (see Appendix C), 


OF OF dy 
3—&,y) + ——G, y)— = 0. (10.2) 
Ox oy dx 


This is a differential equation for y(x), whose solution is the implicit equa- 
tion (10.1) that we started with. 

The nice thing is that there is an easy way to check whether or not a differential 
equation 


d 
fo. y) +e@. y= =0 (10.3) 


is one of these “exact equations’ (so called since the equation is exactly the deriva- 
tive of the function F(x, y) with respect to x). Notice that if we compare (10.3) 
with (10.2) then we would need 


OF 


oF 
fQ@,y)= i and = g(x, y) = ——. (10.4) 
x dy 
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Since the order of taking two partial derivatives does not matter (see Appendix C 
again), if we have (10.4) then we will have 


df OF a-F ag 


oy : Ox dy 7 dy Ox Ox’ 


a a 
a oe (10.5) 
dy Ox 
This equation (10.5) is in fact a necessary and sufficient condition for the orig- 
inal equation to be ‘exact’, i.e. for there to be a function F(x, y) such that (10.4) 
holds. We prove this by showing how to find such a function F under condition 


(10.5). First we want to make sure that the first equation in (10.4) holds, i.e. 


OF 
ae >, f(, y). 


We want to reverse the partial differentiation with respect to x. Remember that 
when we perform a partial differentiation with respect to x we have to keep y 
constant; so any function of y alone, C(y), behaves as a constant would if we were 
carrying out an ordinary differentiation (i.e. it disappears). It follows that when we 
integrate this equation with respect to x, the ‘+c’ term could depend on y: 


F(x, y) = f fe.yas + C(y). (10.6) 
In order to fix C(y) we partially differentiate (10.6) with respect to y, 
OF 0 dc 
peveat eet _y)d sare 
dy | fe rr dy 


(we have an ordinary derivative in the last term since C only depends on y) and 
now we can use the second equation in (10.4) to get 


a) 
a OI) = a / f(x, y) dx. (10.7) 
dy dy 
The condition in (10.5) means that this expression for dC/dy only depends on y, 
since 
0 a) dg ~~ | 
— (gx. y)-— ,y)dx) == - d 
w (se- > f re.nax) = my P(x, y) dx 
dg 
= — —- —— d 
Ox x Ox = f fe es 
dg of 


ox dy 
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By integrating the right-hand side of (10.7) we will find C(y) up to an arbitrary 
additive constant and so obtain the solution. 


Example 10.1 Check that the equation 


d 
x4 %4 24x) =0 
x ~~ — dx 
— 


fay 


(valid for x > 0) is exact and hence find its solution. 


First we calculate 


of 1 dg 1 
—=- and —=-, 
Oy x Ox Xx 


and so the equation is exact and F(x, y) = c for some F with 


OF OF 
Bee fa and ie yr ie, 
Ox x oy 


Integrating 9F /dx = x? + (y/x) with respect to x we get 


4 
Xx 
EGY = tyne Fe), 
To find C, we differentiate this partially with respect to y, 
dc 
Inx + — = y? + Inx, 
dy 


and so C’(y) = y” which implies that C(y) = y?/3; we can omit the constant 
of integration since this can be absorbed into the ‘c’ that occurs in the resulting 
solution: 

y? 

= SC; 


x4 
LOD ey ar 


You cannot rearrange this to solve for y as a function of x; the best you can do 
is to have the solution in this implicit form. 


10.1.1 Integrating factors 


It may be the case that an equation is not exact, but can be turned into an exact 
equation if it is multiplied by the correct integrating factor. 
We have already done this for the simple case of the linear equation 


dy 


= 0). 
ag ee 


92 10 Two ‘tricks’ for nonlinear equations 


Clearly this equation as it stands is not exact, since 
0 a) 
a |p@)y] = p@)40=—[). 
oy Ox 


However, if we multiply by the integrating factor e?), where P is an anti- 
derivative of p, then the resulting equation 


d 
p(xjePy 4 POD = 
dx 
is exact, since 
d P(x) P(x) _ 8 Pa) 
—lpwe “yl= pe =e 
dy 0 


Of course, linear equations are a very special case, and in general there is no 
simple way to find an integrating factor. Suppose that we start with the more gen- 
eral equation 


dy 
FRY) + 8%, WT = 0, (10.8) 
Xx 


and try to turn this into an exact equation by multiplying both sides by /(x, y). 
Then we obtain 


d 
f(x, yI(x, y) +a, yl, y= = 0, 


and for this equation to be exact we need 


0 0 
ay y)I (x, y)] = —Ie@, yl, y)], 
y Ox 


or 


(= 7 *) peg ag (10.9) 
dy Ox 


This is a partial differential equation for 7, and is certainly no easier to solve than 
the original equation (10.8). 

However, there are situations in which it is possible to simplify (10.9). For ex- 
ample, suppose that we assume that there is an integrating factor J that depends 
only on x. Then (10.9) becomes 


dl 1/af a 
= (2 = an (10.10) 
dx g\dy 0x 
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This equation will have a solution that depends only on x provided that 


= & = t) (10.11) 


depends only on x. 


Example 10.2 Find an integrating factor depending only on x that will make 


2ye* d 
3 sin y + Sye™* + —— + (xcosy +) = =0 
a 


7 g(x) 


an exact equation, and hence find its solution. 


There will be an integrating factor that depends only on x if (10.11) holds. This 
gives 


1 of dg 1 5 2e* : 
Set ge re se | 5e* SS 5e* 
(Z se) += | cos y + 5e" + a (cos y + 5e°") 


I ( =) 
= ————__~— [2cosy+ 
xcosy +e* x 


So, from (10.10), we need 


di 21 
dx x. 
Separating variables gives 
ldi 2 
Idx x 
and so 
InJ = 2Inx, 


which implies that J (x) = x”. Multiplying the equation by x? yields 


d 
3x” sin y + 5x7ye™ + 2xye* + (x3 cos y + x7e™) 7 = 0, 
_————————— ee ee ib 
f(x) 8(x) 
which is now exact, since 

af ag 

af = 3x7 cos y + 5x7e* 4+ 2xe* = oe 

oy Ox 
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To find F(x, y), we first integrate 
OF 


ae = f(x,y) = 3x7 sin y + 5x7ye™ + 2xye* 
x 


partially with respect to x to give 
F(x, y) =x? siny + x*ye* +C(y). 


To fix C(y) we differentiate F partially with respect to y, 


OF 
nae x cos y + x7e* + C'(y). 
y 


We therefore have dF /dy = g(x) if C’(y) = 0. So we finally have our solution, 


F(x, y) =x°siny +-x7ye* +C =0. 


10.2 Substitution methods 


In some cases it is possible to simplify an equation considerably by making an 
appropriate substitution. Just as with integration, knowing what this ‘appropriate 
substitution’ might be is not always clear, and given a general differential equation 
it is probably not the case that it can be solved by a clever substitution. However, in 
this section we cover two types of first order equation that can be easily identified 
and then solved in this way. 


10.2.1 Homogeneous equations 


A first order differential equation is said to be homogeneous! if it can be written in 


the form 
d 
ee ae (=) 
dx x 


In this case we can make the substitution u = y/x. So then y = ux and using the 
product rule gives 


so that 


which is a separable equation. 


! We also use this word in a different, but related, sense to describe a linear equation of the form ajd" y/dx” + 
+++ + agy = 0, see Section 3.3. 
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Example 10.3 By means of an appropriate substitution solve the equation 
dy 4.2 Z 
xy— = 2x°+3y". (10.12) 
dx 
Dividing both sides of (10.12) by xy gives 
dy 


9" 43 


bs 
dx y x? 


and the right-hand side is a function of u = y/x, 


2 
F(u) = —+3u. 
u 


We substitute u = y/x; therefore y = xu, and so y’ =u + xu’. Thus 


which gives 


du 2 42 
tS u 
dx u 
Separating the variables we get 
2 
e du = —dx, 
1+u? x 


and so 


u 2 
[pas [ro 
1+ u2 x 


5In(1l+u?) = 2In|x| +c, 


which gives 


or 
V1 +u2 = Ax, 
ie. u(x) = +VA2x4 — 1. Since y(x) = xu(x) the final answer is 
y(x) = +xVax4 — 1, 


where we have replaced A” by a > 0. You might like to check that this really is 
the solution of (10.12). 
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10.2.2 Bernoulli equations 


Another type of equation that can be solved by substitution is the so-called 
Bernoulli equation, 


dy . 
ay P(Xx)y=q(x)y". (10.13) 
x 
When n = 0 or | this is just a linear equation. For n taking other (perhaps negative) 
values this falls into none of the classes we have considered so far. 


However, the substitution wu = y!~” turns (10.13) into a linear equation: 


aie Gs ayy 

dx dx 
= (1—n)y "[—p(x)y + q(x)y"] 
= (1 —n)[—p(x)y'" +(x) 


= (1 —n)[—p()u + q(x)]. 
The resulting equation for u is 


du 
Pe ee ee 
Xx 
a linear equation that we can solve using the integrating factor method of Chap- 


ter 9. 


Example 10.4 Use an appropriate substitution to find the general solution of 


d 

ee 6xy = 2xy?. (10.14) 
dx 

This is clearly of the form in (10.13) with n = 2, so we set u = vt Then we 


have 


du _—s 1 dy 
dx sy dx 
6 
pete — 2x 
y 
= —6xu — 2x, 
or 
d 
aa re —2x. 
x 


The integrating factor for this equation is 


I(x) = exp (/ 6x ax) — exp(3x?). 
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Multiplying both sides by e3*” we have 


Yue") = —2xe3*”, 
dx 
Integrating both sides with respect to x gives 
1 
u(x)e = ae +c, 
and so 


1 3,2 
u(x) = =3 +ce ; 


Since y = 1/u we have 


3 


ya)y= Ae 


Again, you might like to check that this really is a solution of the original differ- 
ential equation. 


Exercises 


10.1 Check that the following equations are exact and hence solve them. 


(i) 
2: 2 dy 
(2xy — sec” x) + (x* + 2y)— =0, 
dx 
(i1) 
x x x dy 
(l+e*y+xe*y) + (xe 5 an 
(iii) 


d 
(x C08 y + cos x)= + siny — ysinx = 0, 
x 


and 
(av) 
xX os x y dy 
e* siny+y+(e cosy +x +e") = 0. 
x 
10.2 Find an integrating factor depending only on x that makes the equation 
d 
e > secx + 2cotx — er = 6 
x 


exact, and hence find its solution. Hint: f cosec x dx = In |cosec x — cot x|. 
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10.3 Show that any equation that can be written in the form 


dy 
f(x)t+ By =0 
XxX 


is exact, and find its solution in terms of integrals of f and g. Hence find the solutions 


of 
(1) 
d 
V(x) +2y— =0 
dx 
and 
(ii) 
1 d 2 
( «) ud + b=0, 
y dx x 
forx,y >0. 


10.4 By substituting u = y/x solve the following homogeneous equations: 
(i) 

ndy 

dx 


Sy a SK =0 


(the solution is y = x tan(In|x| + c)). 
(i1) 
dx x x? + t/t? + x? 


dt tx 


(the solution is x(t) = +f (In |t| + c)? — 1). 


10.5 You could solve 


d 
aires kx — x”. 
dt 


by separating variables (see Exercise 8.7). Instead, substitute « = x~! and show that 
u satisfies the linear equation 


Solve this equation for u(t), and hence find the solution x(¢). 
10.6 Use an appropriate substitution to solve the equation 


x= x(k? _ x). 


You should recover the solution (8.16) found by separating variables. 


Part II 


Second order linear equations with constant coefficients 


11 


Second order linear equations: general theory 


We will now turn to second order differential equations, 


d*x . 
qe fe D: (11.1) 


In this chapter we address the kinds of theoretical question that we covered for first 
order equations in Chapter 6. As such there are few examples, but we will return 
to more concrete problems and solution methods in the next chapter. 


11.1 Existence and uniqueness 


First we discuss the existence and uniqueness of solutions. Before we do this for- 
mally, we give an indication of why we will need to specify both x and x in our 
“initial condition’. 

Consider the simplest type of second order equation, 


d?x 
rida ADs (11.2) 


the second order equivalent of the ‘trivial’ equations we considered in Chapter 5. 
We can solve (11.2) by integrating twice: if F is any anti-derivative of f then 


dx 
—@H=FO+e (11.3) 
dt 
and then if F is any anti-derivative of F 
x(t)=F(t)+tect+oo. 


The two integrations result in two arbitrary constants (c; and c2); specifying x (fo) 
alone will not be enough to tie down the solution, but we need to specify x (fo) (to 
fix cy) and then x(fo) (to determine cz). 
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To put this in a physical context, equation (11.2) is the equation of motion for 
a particle moving under the influence of a force f(t) per unit mass. In order to 
predict the motion of the particle in the future we need to know both its current po- 
sition and its current velocity (e.g. the difference between dropping and throwing 
a piece of chalk). 

We now state the existence and uniqueness theorem. As with our previous ex- 
istence and uniqueness theorem (Theorem 6.2) it can be paraphrased as “unique 
solutions exist provided that f is sufficiently nice’. 


Theorem 11.1 Given a function f (x2, x1, t), suppose that f, df /dx, and df /dx2 
are continuous functions for a, < x, < a2, by < x2 < bo and t; <t < to. Then 
for all initial conditions 


x(to) = xo and x(to) = yo (11.4) 


with ay < x9 < a2, bi < yo < b2 and t) < to < t2 there exists a unique solution 


of 
¥= fxd (11.5) 


on some interval I containing to, i.e. a continuous function with two continuous 
derivatives that satisfies (11.4) and the equation (11.5) on I. 


11.2 Linearity 


In the following chapters (Chapters 12—20) we will concentrate on linear second 
order equations, the most general form of which is 


d?x dx 
a2(t) a + ai(t)—- + ao(t)x = g(t). 


Most of the time during any general treatment we will assume that a2(t) 4 0, 
divide by az, and rewrite the equation as 


d*x dx 
qe eae +q(t)x = f() (11.6) 
(cf. (3.5)), since this saves a little algebra. 

When we have not specified the initial conditions for (11.6) the equation will 
not have a unique solution, and we would expect its general solution to have two 
arbitrary constants so that we can fit any pair of initial conditions, x (to) = x9 and 
X (to) = yo. We now investigate this a little further, starting with the homogeneous 
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problem 
d2x 
di? 
(recall that (11.6) is called ick: when f(t) = 0). 


Our first observation is crucial: if x;(t) and x2(t) are two solutions of (11.7) 
then so is 


+ro> “+ q(x =0 (11.7) 


X(t) = ax, (t) + Bx2(t) 


for any choice of two real numbers @ and £. This is known as the principle of 
superposition of solutions. To see this, first note that 


dx dx dx2 d2x d2x1 d2x%5 
- d = ; 
arr cre an 7 ae 7 Ba 
therefore 
25 dx} 
de +o + q(t)x =a i + pO +q(x1 
d?x2 
+B 2 + py + gin =0. (11.8) 


We can express this in a more elegant way if we are prepared to define some extra 
notation. Although this might appear complicated, all that we are going to do is to 
define a shorthand so that instead of always having to talk about 


d2x 
dt dre 
we can refer simply to L[x]. 


Given a function x(t) that has two derivatives we simply define L[x] to be 
(11.9), i.e. the left-hand side of our equation: 


+o > “+ q(x (11.9) 


d?x dx 
L{x](t) = ae Pg eee): (11.10) 


The argument ‘(t)’ has been included here to emphasise that L[x] is a function of 
t; starting with x(t), which is itself a function of t, L[x](t) gives another function 
of t. This Z is referred to as a differential operator, since it ‘operates’ on x(t) by 
performing various differentiations. We will be somewhat more rigorous about the 
definition of L in Section 11.5. 

Now we can express our observation about adding multiples of solutions by 
saying that the operator L is linear; this simply means that 


Llax, + Bx2] = aL[x1] + BL[x2] (ITT) 
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for all choices of a, 8 € R and functions x; and x2 that have two derivatives (so 
that L[x,] and L[x2] make sense). If we write equation (11.11) in full using the 
definition of L then it gives precisely the first equality in (11.8). It is this property 
of linearity which makes linear equations so much easier to solve than nonlinear 
ones. 


11.3 Linearly independent solutions 


We will now see that in order to find the general solution of (11.7) we have to find 
two ‘different’ solutions x;(t) and x2(t); from these we will be able to construct 
any solution x(t) as a linear combination 


X(t) = ax) (t) + Bxo(t). (11.12) 


11.3.1 Linear independence of functions 


At the moment we have no well-formed idea of what it means for two solutions 
to be ‘different’. The appropriate notion is that the two solutions are linearly inde- 
pendent, an idea borrowed from vector algebra. The n vectors vj, ..., Up are said 
to be linearly independent if the only solution of 


ayvpt+---+ anv, = 0 


is dj] =Q). =--- =a, = 0, Le. if none of them can be written as a linear com- 
bination of the others. We now make the same definition replacing the vectors 
V],..., U, With functions x;(t), ... , Xn(t). 


Definition 11.2 The functions x,(t), ..., X(t) are linearly independent on an in- 
terval I if the only solution of 


ay xy(t) +--+ tanxp(t) =0 for all tel 
isd] =::-=a, = 0. 


When we have only two functions their linear independence simply says that 
they are not proportional on /. If x;(¢) and x2(t) are proportional on / then for 
some constant c 


x(t) = cx (t); 


it follows that cx1(t) — x2(t) = 0 on J, and so x; and x2 are linearly dependent. 
Conversely, if x; (t) and x2(t) are linearly dependent on J then for some non-zero 
a, and a2 


ax, (t) + a2x2(t) = 0 for all tel 
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which implies that 
a2 
x1(t) = ——x2(t), 
ay 


i.e. x; and x2 are proportional. 


11.3.2 Two linearly independent solutions are necessary 
and sufficient 


First of all, we will see that it is not possible to obtain all possible solutions of 
pO Par 0 (11.13) 
‘eo ap eS 


(equation (11.7)) as multiples of a single special solution. Suppose that x (¢) is the 
solution of (11.13) satisfying 


x1 (to) = 1 and x1 (to) = 0, 
while x2(f) is the solution that satisfies 
x2(to) = 0 and x2(to) = 1. 


Using Theorem 11.1 both these solutions exist and are unique; but it is clear that 
one cannot be a multiple of the other. This shows that at least two linearly inde- 
pendent solutions are necessary. 

We now see that given two solutions x;(f) and x2(t) that are not proportional 
(like the two just defined above) we can find a and 6 such that the linear combi- 
nation 

x(t) = ax1(t) + Bx2(t) 
(which must also solve (11.13)) satisfies any given initial condition 


x(to) = xo and X(to) = vo. (11.14) 


The correct values of w and 6 can be obtained by solving the simultaneous equa- 
tions 


ax (to) + Bx2(to) = xo 
ax (to) + BXx2(to) = vo. 


Writing these as a matrix equation 


x1(to) x2(to)\ (a) _ (x0 
ee os) (11.15) 
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it is easy to see that we can solve for a and # provided that the matrix on the left- 
hand side is non-singular. This happens whenever its determinant is non-zero (see 
Appendix B). 

We will assume that the determinant is zero, 


x1 (t0)*2(to) — x2(to)*1 (to) = 0, (11.16) 
and deduce a contradiction. It follows from this assumption that! 


X1 (to) i X1 (to) 
X2(to) x2(to) 


=c, say, 


and so 
x1 (to) = cx2(to) and X1(to) = cx2(to). (11.17) 


Because the equation is linear this implies that x2(t) and x;(t) are proportional; 
since x(t) is a solution so is y(t) = cx2(t), and clearly y(t) satisfies the initial 
conditions 


y(to) = cx2(to) = x1 (to) and y(to) = cx2(to) = x1 (to). 


Since solutions are unique, it follows that x(t) = y(t) = cx2(t) for all t. However, 
we chose x; (t) and x2(t) to be two linearly independent solutions, so we know that 
they are not proportional. 

Since our assumption that the matrix in (11.15) is singular has led us to a contra- 
diction, the matrix must be non-singular. Hence it is possible to solve this equation 
and find values of w and 6 such that x(t) = ax1(t) + Bx2(t) satisfies the required 
initial conditions. 

Thus, as claimed, two linearly independent solutions are necessary (we need 
at least two) and also sufficient (two will do) to form any solution as a linear 
combination 


X(t) = axi(t) + Bxo(f). 


11.4 *The Wronskian 


We have just seen that the determinant of the matrix in (11.15) is closely related to 
the linear independence of the functions x;(t) and x2(t). We now investigate this 
a little further. If x; and x2 are linearly independent on an interval J then the only 


! The following line assumes implicitly that x2 (to) 4 0 and that x (to) 4 0. It is certainly not possible that 
x2 (to) = X2(to) = 0, since then the uniqueness of solutions would imply that x2(t) = 0 for all t. If x2(t9) = 0 
(and x2 (to) 4 0) then (11.16) implies that x1 (tg) = 0, and then equation (11.17) follows once more. A similar 
argument yields (11.17) if x2(t9) = 0 and x2(to) 4 0. 
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solution of 
ax,(t) + Bx2(t) = 0 for all tel (11.18) 


should be a = 6 = O. If (11.18) holds for all t € 7 then we can differentiate and 
obtain a second equation 


ax,(t) + Bxo(t) = 0 for all tel. 


Putting these two equations together we obtain the matrix equation 


ex Be eles 
xi(t) w20t)/\B/ \Os° 


If the matrix in this equation is non-singular for some fo € J then we can find 
the solution (@, 6) by multiplying by the inverse of the matrix. This will give 
a = 6 = 0, implying that x;(t) and x(t) are linearly independent on /. 

There is a special name for the determinant of this matrix, the Wronskian of x, 
and x2, written as W[x1, x2](t) (note that the Wronskian is a function of f): 


Xi(t) x(t) 
X(t) X(t) 


We can re-express what we said above by saying that if W[x1, x2](f) 40 on J 
(is not identically equal to zero on J) then the functions x; and x2 are linearly 
independent on /. 

Conversely, in Section 11.3.2 we showed that (11.16) implies that x;(t) = 
cx2(t) forall t € J. Since (11.16) is just W[x1, x2] (to) = 0, we have already shown 
that if W[x1, x2](t) = 0 anywhere on the interval J then x; and x are linearly de- 
pendent. 

Therefore two solutions of a linear second order equation are linearly indepen- 
dent if and only if their Wronskian is non-zero. For some other properties of the 
Wronskian see Exercises 11.2 and 11.3. 


W[x1, x2](t) = = x1 (t)x2(t) — x2(t)x1(¢). 


11.5 *Linear algebra 


Linear algebra is the abstract study of the properties of linear spaces, and linear 
maps between such spaces. Because the subject is abstract it can be daunting, but 
it is its very abstraction that makes it widely applicable. Here we will see that the 
above results about linear differential equations can be very naturally recast within 
the linear algebra framework. Those unfamiliar with the ideas of linear algebra 
should feel free to move on to the next chapter. 

The fundamental concept in the theory is the notion of a vector space; a (real) 
vector space is a collection V of elements along with notions of addition and 


108 11 Second order linear equations: general theory 


multiplication, such that if vj and v2 are elements of V then 
avjptpuEeV for all a,BeER. 


The prime example is the collection of all vectors in R”. 

In order to set our results in this context, the first thing we need to do is to be a 
little more careful about our definition of the linear operator L. In order to do this 
we have to be more precise about the “functions x with two derivatives’ for which 
L[x] is sensible. 

We will denote by C (7) the collection of all continuous functions that are de- 
fined on the interval J. This is a vector space, since if f, g € C°(/) then 


af+BgeC) forall a,feR. 


Similarly, the space C*(/), consisting of all continuous functions on J with con- 
tinuous first and second derivatives, is also a vector space. 
Given a function x € C?(J), the linear operator L defined as in (11.10) by 


d2x 


ToS ee ee 
lS re tae 


is certainly sensible, since x has two derivatives. Assuming that p(t) and q(t) are 
continuous functions, it follows that L[x] € C °”0 ), since x, x, and x are continu- 
ous. 

Thus L is a map from C (7) into C°(/). Furthermore, as we have already seen, 
it is a linear map, i.e. 


Llax, + Bx2] = aL[x;] + BL[x2] 


for any a, B ER. 
The kernel of a linear operator L : E — V (where E and V are vector spaces) 
consists of all those elements of E that are mapped to zero by L, 


ker(L) = {x € E:: L[x] = O}. 


The kernel of our differential operator L consists of all the elements of C (1) for 
which L[x] = 0, i.e. precisely the set of all solutions of the homogeneous equation. 

Now, it is a general result that the kernel of a linear operator is itself a vector 
space, in other words if x; € ker(L) and x2 € ker(L) then 


x; €ker(L) and x2 € ker(L) => ax, + Bx2 € ker(L). 
We have seen this already for our operator L, since it is just another way of writing 


L[x;}]=0 and L[x2.]=0 => Llax, + Bx2] =0 
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(cf. (11.11)), i.e. we can add multiples of solutions of homogeneous linear equa- 
tions and still have a solution (the superposition principle). 

In order to construct the general solution of a homogeneous second order linear 
ODE we have just seen that we need two linearly independent solutions x, (t) and 
X2(t); using these we can construct any solution as 


ax (t) + Bx(t). 


Put another way, any element of ker(L) can be written as a linear combination of 
the two linearly independent elements x; and x2. This says precisely that x; and 
x2 form a basis of ker(L), i.e. a linearly independent spanning set. 

Since the number of basis elements for a vector space is exactly what we mean 
by its dimension, it follows that the dimension of ker(L) is two. Given our defini- 
tion of L, the statement 


dim ker(L) = 2 


is an elegant (albeit abstract) way of saying “we can construct any solution of a 
homogeneous second order linear ODE given two linearly independent solutions 
(and two are needed)’. 


Exercises 


11.1 By finding the Wronskian of the following pairs of functions, show that they are 
linearly independent: 
(i) xy (t) =e! and x(t) = e&! with ky F ko, 
(ii) x1 (t) = e* and x2(t) = te*, and 
(iii) x1 (t) = ef! sinwt and x2(t) = e” cos at. 
11.2 Show that the Wronskian for two solutions x; (t) and x2(t) of the second order dif- 
ferential equation 


d2x dx 


a2 tg, ee (E11.1) 


satisfies 
W(t) = —pi(t)W(t). 


(Write W(t) = x1 (t)x2(t) — x2(t)x1(t), differentiate, and use the fact that x;(t) and 
x2(t) satisfy the equation (E11.1).) Deduce either that W(t) = 0 for all t, or that 
W(t) ~ 0 for all f. 

11.3 We have seen that if x; and x2 are two solutions of a linear differential equation, then 
they are linearly independent if and only if their Wronskian is non-zero. The simple 
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example of this question shows that this is not true for general functions that are not 

the solutions of some differential equation. 

(i) Check carefully that if f(t) = t?|t| then df/dt = 3¢|t| (this is easy when t + 0; 
you will have to use the formal definition of the derivative at t = 0). 

(ii) Let 


filt) = t7|t| and purser. 


Show that although these two functions are linearly independent on R, their 
Wronskian is identically zero. 


12 


Homogeneous second order linear equations 
with constant coefficients 


In this chapter we will find the general solution of the homogeneous linear equation 


ES eee eM (12.1) 
0 ap dt CX =U. ‘ 


From our analysis in the previous chapter we would expect this general solution to 
be of the form 


x(t) = Axi (t) + Bx2(t) 


where x(t) and x2(t) are two linearly independent solutions of (12.1). 
In order to find these two solutions, we ‘guess’ that they are of the form 


x(t) = eM 


and substitute this into (12.1). To see that this is reasonable, first, remember that 


e* is a solution of the constant coefficient first order linear equation 
dx 
—=kx, 
dt 


so that we have already seen exponential functions in the context of linear equa- 
tions. More tellingly, we know that taking derivatives of x(t) = e only multiplies 
x(t) by k. So x, x, and x will all be just some constant times eft ; we should be able 
to cancel the e*'s and all being well end up with an equation that we can solve. 
This is exactly what happens. Since 
2 


d kt kt d kt 2 kt 
— — k d erred = k ’ 
? (e") e an a2 (e") e 


substituting x(t) = e™ in (12.1) gives 


ak?ek + bke* + cek = 0. 


111 
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k 


Since e“ is never zero we can divide by e“ and find that we are left with a quadratic 


equation for k, known as the auxiliary equation, 
ak* + bk +c =0. (12.2) 


Our substitution has reduced the original differential equation for x(t) into a 
simple algebraic equation for k. Since (12.2) is a quadratic equation we stand a 
good chance of finding two roots, which would provide us with the two inde- 
pendent solutions we need to form our general solution. However, although we 
might have two distinct real roots, it is also possible to have only one repeated 
real root, or even two complex conjugate roots (and therefore no real roots). These 
three cases lead to different types of solutions for (12.1) and we consider each in 
turn. 


12.1 Two distinct real roots 
Using the quadratic formula, the solutions of (12.2) are given by 


be Vb? — 4ac 
7 2a , 


k 


Provided that b* — 4ac > 0 we can solve the equation to give two distinct real 
roots, which we will call k, and k2. We have found two different values of k that 
will make our guess e“ a solution of equation (12.1), so we have already obtained 
the two solutions we require; it is clear that if ky # ky then e“ and e'" are linearly 
independent on any interval (we could also check this using the Wronskian, see 
Exercise 11.1). 

The general solution of (12.1) is therefore given by a linear combination of these 
two solutions, 


x(t) = Ae“! + Bel", 


Note that if one of k; and k» is positive (or both of them) then there are solu- 
tions that tend exponentially fast to +-oo, and solutions that tend exponentially fast 
to —oo (depending on the initial condition), both as t — +o. It is only if both 
k, and ko are negative that all the solutions decay exponentially to zero. See 
Exercise 12.2 for more on this. 


Example 12.1 Find the general solution of the equation 
X+x-—6x =0, 
and the solution that satisfies the initial conditions 


x(0) =1 and x(0O) = 2. 
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The equation has x(t) = e*! as a solution when k satisfies 
ke +k—-6=0. 
The roots of this are k = 2 and k = —3, so the general solution is 
x(t) = Ae” + Be~*’. (12.3) 


To fit the initial conditions we substitute in to (12.3) and in to x(t) = 2Ae~! — 
3Be~*' and then solve the resulting simultaneous equations 


A+B=1 2A —3B =2; 


so A = 1 and B = 0 which gives the solution 


x(t) = ent 


If one of the roots of the auxiliary equation is zero then one of the two linearly 
independent solutions will be e°*’ = e® = 1, i.e. a constant. 


Example 12.2 Find the general solution of 
xX —2x =0. 

If we try x(t) = e* for this example then 
k? — 2k =0, 


with roots k = 0 and k = 2, giving the two solutions e®*’ = e° = 1 and e~’. It 
follows that the general solution is 


x(t) = A+ Be”. (12.4) 


12.2 A repeated real root 


When b? = 4ac in the auxiliary equation (ak? + bk + c = 0) we get a repeated 
root k, and so we only obtain a single solution x(t) = e*’. However, we know that 
in order to write down the general solution we need to have two linearly indepen- 
dent solutions. 

In this case the second solution! includes an extra factor of t: it is te. We will 
see in Chapter 17 that it is possible to derive this solution systematically, but for 
now we will just check that it really is a solution. 

When the auxiliary equation has a repeated root we must have b* = 4ac and 
we can use this to put the linear equation ax + bx + cx = 0 into a standard form. 


! For the use of the Wronskian to make sure that these two solutions are linearly independent see Exercise 11.1. 
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Dividing through by a gives 
segs (ey Ae 
tHe HS. 
a a 
Since b* = 4ac this is the same as 
Ct es = a =0 
e Ay age "2a 
and writing A = —b/2a this becomes 
¥-—2i+dx=0. 
If we try x(t) = e* in this equation then k must solve 
k? — 20k +2? = (k—A)? =0, 


and there is indeed just one repeated root k = i. 


(12.5) 


We now try the second solution given above, x(t) = te”. For this we have 


x =e 4+ Ate K = 2re* + a7 te, 
and so substituting into the left-hand side of (12.5) we get 
(ie! 4237767) = Die + Ate) 7G”) 


which is zero. 


So the two linearly independent solutions that we get in the case of a repeated 


root k are e*" and te“, and the general solution is 


x(t) = (A+ Bre™. 


(12.6) 


Clearly if k > 0 then solutions tend to infinity (+00 or —oo depending on the 
initial condition), while if k < 0 the solutions decay to zero; te* tends to zero if 


k < 0 (see Figure 12.1). 
Example 12.3 Find the solution of 
X+2x+x=0 
that satisfies 
x(0) =0 and x(0) = 1. 
The equation has a solution x(t) = ek! if k solves 


ke 4+2k+1=0. 
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Ll 
0 5 10 15 


Fig. 12.1. The graph of te~’ against t: despite an initial increase the function 
decays to zero as t > oo. 


This equation is (k + 1)* = 0, so there is a repeated root k = —1. Two independent 


solutions of the equation are e~’ and te~', so the general solution is therefore 


x(t) = (A+ Bre“. 
In order to satisfy the initial conditions we substitute in to find A and B, so we 
need, since x(t) = (B — A)e! — Bte™, 
A=0 B-A=1, 
and the solution in this case is 


x(t) =te', 


whose graph is shown in Figure 12.1. 


12.3 No real roots 


When b? < 4ac the expression within the square root in the quadratic formula is 
negative, and so there are no real roots. Instead we obtain a pair of complex roots, 


b a _v4ac — b2 
i ; 
2a 2a 


For brevity we will write these as k=p+iow (so p=—b/2a and w= 
/4ac — b*/2a). The general solution corresponding to these roots is 


k=- 


x(t) =e?" (Acosat + B sinat). (12.7) 
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The real part of the root gives an exponential, and the complex part gives oscillat- 
ing sine and cosine functions. 

When you come across complex roots in a problem you should just write down 
(12.7). However, we will now see how to derive the general solution (12.7) using 
the complex roots of the auxiliary equation. If you are not happy with complex 
numbers, or just not interested, then you can move to equation (12.9) 

Even though the roots are complex there is nothing wrong with our argument. 
With k = p + iw the expression e* does solve the equation, and so the general 
solution can be written as 


x(t) = CePtiodt 4 pele-io)t 


However, our solution x(t) should be real, so we need to restrict the possible values 
of C and D; since e?—) is the complex conjugate of et!", in order to make 
the whole expression real we want D to be the complex conjugate of C (we write 
D = C%*). Real solutions of the equation are given by 


x(t) = Ce’ Hiodt 4 Cree ia)t (12.8) 


for an arbitrary complex number C = a + if. Note that the solution still has the 
two arbitrary constants (now @ and #) that we would expect. With a little work 
we can rewrite this expression in a form that involves no complex numbers and 
thereby recover (12.7). 

To do this we need to use the fact that 


0 


e’” =cosé +ising 


and that for any complex number z, z + z* = 2 Re(z) (see Appendix A). Going 
back to (12.8) and using these two facts we get 


x(t) = 2Re[Ce ti?" 
= 2e” Re[Ce’] 
= 2e?' Re[(w +if)(cos wt +isinat)] 
= 2e"' (a coswt — B sinat) 


=e?" (Acosat + Bsinwt), 


if we set A = 2a and B = —28. Since @ and £ were entirely arbitrary, so are A 
and B. 

It is worth emphasising again that when you are trying to solve an equation 
and come across a complex conjugate pair of roots p + iw, you should immedi- 
ately write down the general solution (12.9), rather than going through the above 
analysis. 
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The solution 
x(t) =e?" (Acos at + B sinat) (12.9) 


naturally splits into two parts: a factor e?’ outside the square brackets, and an 
oscillating factor inside the square brackets (remember that we have already seen 
in Section 9.4.3 how to combine A cos wt + B sin wtf to give one oscillating term 
M cos(wt — @)). 

All such solutions oscillate, and whether solutions decay or grow depends on the 
sign of p. If o < 0 then all the solutions decay exponentially to zero; if o = 0 then 
the amplitude is constant as we have pure oscillations; and if p > 0 then the solu- 
tion oscillates with an amplitude that grows exponentially fast. These possibilities 
are illustrated in Figure 12.2. 

The simplest example involving complex roots is when k = tiw, known as sim- 
ple harmonic motion; we consider this in more detail in the next chapter. 


Example 12.4 Find the solution of 


K+2x+5x=0 x(0) =1 x(0) =0. (12.10) 


p<O 


T T T T T 


p>o 


Fig. 12.2. Sample solutions of the form x(t) =e? (Acoswt + Bsinwt) for 
(from top to bottom) p <0, o =O and p>0. Each graph shows x(t) 
against f. 
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Kt is a solution if k satisfies 


For this equation x(t) =e 
k? +2k+5=0 
which has roots 


—2+/4-2 
ad apples fe= = —-1+2i. 


k = ———___ 


2 
So the general solution of (12.10) is 


x(t) =e '(Acos2t + B sin2r), 
showing that the origin is stable. Since 
x(t) = e '((2B — A) cos2t — (2A + B) sin2r) 
the initial conditions pick out the solution with 
A=1 2B—-—A=0, 


ie. A=landB= 5 so that 


x(t) = e ‘(cos 2t + 5 sin 21). 


Exercises 


12.1 Find the general solution of the following differential equations, and then the solution 
satisfying the specified initial conditions. 
(i) x — 3x + 2x = 0 with x(O) = 2 and x(0) = 6; 
(ii) y” — 4y’ + 4y = 0 with y(0) = 0 and y'(0) = 3; 
(iii) 2” — 42’ + 13z = 0 with z(0) = 7 and z’(0) = 42; 
(iv) y + y — 6y = O with y(O) = —1 and y(0) = 8; 
(v) y —4y = 0 with y(O) = 13 and y(0) = 0; 
(vi) 6 + 40 = 0 with 6(0) = 0 and 6(0) = 10; 
(vii) y+ 2y + 10y = 0 with y(O) = 3 and y(0) = 0; 
(viii) 2z + 7z — 4z = 0 with z(0) = 0 and z(0) = 9; 
(ix) y+2y + y = Owith y(O) = O and y(0) = —1; 
(x) x + 6x + 10x = 0 with x(0) = 3 and x(0) = 1; 
(xi) 4x — 20x + 21x = 0 with x(0) = —4 and x(0) = —12; 
(xii) y + y — 2y = 0 with y(O) = 4 and y(0) = —4; 
(xiii) y — 4y = 0 with y(O) = 10 and y(0) = 0; 
(xiv) y” +4y’ + 4y = 0 with y(O) = 27 and y’(0) = —54; and 
(xv) } + @*y = 0 with y(0) = 0 and y(0) = 1. 
12.2 If the roots of the auxiliary equation are kj > 0 and —k2 < 0 then the solution is 


x(t) = Ae™! + Be", 


12.3 
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For most choices of initial conditions 
x(0) = xo x(0) = yo 


we will have x(t) — -too as t > oo. However, there are some special initial con- 


ditions for which x(t) > 0 as t — ov. Find the relationship between xo and yg that 
ensures this. 

(T) Solutions of linear equations with constant coefficients cannot blow up in finite 
time; it follows that their solutions exist for all t € IR. To see this, we will consider 


X + px+qx=0 with x(0) = xo and x(0) = yo 


for t > 0 (a similar argument applies for t < 0). By setting y = x, we can rewrite this 
as a coupled pair of first order equations 


x=y 
y= —py— qx. 


Show that 


1d 
aa + y*) = (1—q)xy — py’, 


and hence that 
£ (ty) < Ut lg + 21d? +9. 
Using the result of Exercise 9.7 deduce that for t > 0 
rE yO) = OO): BIO ee, 


showing that finite-time blowup is impossible. Hint: xy < 5 (x? + y*). (The same 
argument works, essentially unchanged, for 


E+ px +(x =0 
provided that | p(t)| < p and |q(t)| < q for allt € R). 
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Oscillations 


In this chapter we look at oscillating mechanical systems, which form one very 
natural class of examples of second order linear equations. 


13.1 The spring 


The simplest system that gives rise to oscillations is a mass on a spring. If the 
spring has a natural length /, then Hooke’s law says the force exerted by the spring 
when it is extended an additional length x is proportional to this extension x. 
So the equation of motion for a mass (with mass m) on the end of spring (see 
Figure 13.1) is 


mx = —kx, 


where k is the “spring constant’. 
If we divide by m and set w* = k/m then this reads 


x= ox. (13.1) 
Equation (13.1) is a linear equation. Trying x(t) = e* we get 
k= 0 


as the auxiliary equation that determines k. This equation has complex roots k = 
sti, and it follows that the general solution is of the form 


x(t) = Acosat + Bsinat. (13.2) 


We can use the same trick we used in Section 9.4 to rewrite our solution to make 
it clear that it is one oscillation. If we write 


x(t) = VA2 + B2 


cos wt + (13.3) 


A B : 
SS ————. sin wt 
J A2 + B2 J A2 + B2 
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Fig. 13.1. A spring: the top picture shows the spring at its natural length /, unex- 
tended, while the lower picture shows the spring extended by a length x. 


Fig. 13.2. The solution x(t) = M cos(wt — ¢) plotted against t is shown here as 
a solid line. For comparison the dotted line shows the graph of x(t) = M cos at; 
the delay between the two solutions is 6/w, as indicated. 


then the squares of the coefficients of cos and sin within the [- - -] now sum to one, 
so we can find @ with 


A B 


cos ¢ = —————— and sin 6 = ————— 
J A2 + B2 J A2 + B2 
(i.e. with tang = B/A). Writing M = / A? + B? then (13.3) becomes 


x(t) = M[cos¢cos at + sing sin at] 
= Mcos(at — ¢), 


using the double angle formula cos acos b + sina sinb = cos(a — b). This solu- 
tion is illustrated in Figure 13.2. 
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Fig. 13.3. The simple pendulum. 


13.2 The simple pendulum 


We now consider the oscillations of a pendulum. Suppose that the pendulum has 
length L, that the mass of the shaft is negligible, and that the bob has mass m. 
Figure 13.3 shows the general setup; 0 is the angle that the pendulum makes with 
the downward vertical. 

The potential energy of the pendulum is mg x L(1 — cos@). If the pivot is at 
the origin then at an angle @ the coordinates of the centre of the mass m are x = 
(x, y) = (Lsin@, L(cos@ — 1)). Then x = (%, ») = (Lcos 06, —Lsin 06), and 
so the kinetic energy is 


sm|x|* = 4m(X* + y?) = 4m(L? sin* 667 + L? cos” 667) = 5mL76?. 
Since the total energy 
E = 5mL76? + mgL(1 — cos6) 
is constant, if we differentiate we obtain 
0 =mL766 + mgLsin66, 
which, on dividing by 6, yields the equation of motion! 


d*6 mg 
—= =-— sind. 13.4 
are L on) 
We will investigate equation (13.4) itself in Chapter 35, but for now we approx- 
imate it by a linear equation so that we can use the theory of the preceding chapter. 


! With some care, assuming the continuity of 0(¢) and its derivatives, it is possible to show that this equation is 
also valid when 6 = 0. 
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0 0.2 0.4 0.6 0.8 1 


Fig. 13.4. The left-hand picture is the graph of sin (solid) and @ (dashed) plot- 
ted against 0 for 0 < 6 < 1. Note that the curves are fairly close for 8 < 0.5 and 
almost indistinguishable when 6 < 0.3. The right-hand picture shows the pendu- 
lum at angles 6 = 0.3 and 6 = 0.5 to the downward vertical. Clearly requiring 
—0.3 < 6 < 0.3 is little restriction for a calculation involving the pendulum in a 
clock. 


When the oscillations are small, i.e. when @ is small, we can approximate sin @ by 
@ (see Figure 13.4) and so obtain the linear equation 


—_ = -@’6, (13.5) 


Apart from the change of dependent variable from x to 6, this is the same equa- 
tion as we had before in (13.1). The solution we found in the previous section 
remains valid here, and so we have 


6(t) = M cos(at — ¢). 


Thus the pendulum oscillates about 6 = 0, the downward vertical, with period 
27 /@ = 27,./L/g. Note that in this linear approximation the amplitude M of the 
motion (how wide the swing is) has no effect on the period. 


13.3 Damped oscillations 


The simple equations above neglect any effects of friction or air resistance, or what 
we might more generally call ‘damping’. 
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If we assume that damping exerts a force proportional to the velocity, but in the 
opposite direction, then, with jz > 0, 


mx + ux +kx = 0. 
Dividing by m as before, and setting A = «1/k, we arrive at the model 
¥+Ax+o°x =0 (13.6) 


with 4 > 0. 
In order to solve (13.6) we try x(t) = Ae and obtain the auxiliary equation 


k? +Ak +07 =0. 


The roots of this equation are 


ee earns 

= 5 : 

which gives rise to three possibilities depending on the nature of the roots, all 
illustrated in Figure 13.5. 


k 


Over—damping 


Critical damping 


Under—damping 


X OF 


o t 


Fig. 13.5. From top to bottom: over-damping x(t) = Ae~*!’ + Be~*2! (for large 
t, x(t) ~ Ae! (the dotted curve), where 0 < k, < ky); critical damping x(t) = 
Ate“ + Be (for large t, x(t) ~ Ate~*, shown in the dotted curve); under- 
damping. 
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Over-damping 
When 42 > 4a” there are two distinct real roots —k, and —k», and both are nega- 
tive (since 0 < A? — 4w* < 7). So the general solution is 


x(t) = Ae kit + Be, 


and all the solutions are exponentially decaying and approach zero as t > oo. 
Furthermore there are no oscillations of the system on its way to equilibrium. 


Critical damping 
When 47 = 4@” we have k = —A/2 ‘twice’, so that the general solution is a com- 
bination of e~*"/ and te7*"/2, 


x(t) =(A+ Bre. 


Again, the system settles down to its equilibrium without any oscillations. How- 
ever, this is the critical case, in that any further reduction in the damping allows for 
oscillations, as we will soon see. It is also possible for x(t) to increase for a short 
time interval (see Figure 13.5), see Exercise 13.9 for more details. 


Under-damping 


When A? < 4@” we have a complex conjugate pair for k, 
Me 
k=-~+t10 
2 


(with o = 5V4w? — 22). This gives the general solution 


—At/2 


x(t) =e (A cosot + Bsinot). (13.7) 


The system is always oscillating, but the amplitude of the oscillations decays to 
zero exponentially fast. 

If this solution represents the oscillations of a pendulum damped by frictional 
forces (air resistance, friction at the pivot) then although the amplitude decays the 
period of the oscillations remains constant. It is this effect that makes the pendulum 
an effective mechanism to run a clock. 

To see that this is the case, we rewrite x(t) from (13.7) in the compact form 


x(t) = Me? cos(ot — @), 


and differentiate with respect to ¢ to give 


dx 


A 
apr aM ore cos(ot — ¢) —-oMe“'/* sin(ot — o). 
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t 


Fig. 13.6. Even though the oscillations are decaying, the period of the oscillations 
remains constant. 


Thus maxima and minima occur when 


tan(ot — ¢) = er 


i.e. when 


ee 
ot —@= tan ae +nt 


for any integer n. The time between successive maxima is therefore 27r/o, inde- 


pendent of the amplitude, see Figure 13.6. 


Exercises 


13.1 A spring of natural length / and spring constant k is suspended vertically from a fixed 
point, and a weight of mass m attached. If the system is at rest (x = x = 0) how far 
has the spring extended? If the mass is pulled down slightly from this rest position and 


then released, show that it then oscillates about its equilibrium position with period 


2a /w, where w? = k/m. 


13.2 The acceleration due to gravity in fact depends on the distance R from the centre of 
the Earth: g = GM/R?, where M is the mass of the Earth and G Newton’s gravita- 
tional constant. Show that the period of oscillation of a pendulum will increase as it 


is taken higher. 


13.3 The Earth bulges at the equator; at a latitude 0, the distance to the centre of the Earth 


(measured in kilometres) is approximately 


R@) = {Rr cos? 6 + Rs sin 6, 


where Re = 6378 and Rp = 6357. 


13.4 


13.5 


13.6 
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Displaced volume 
V 


Fig. 13.7. The buoyancy force on an object is equal to the weight of water that it 
displaces. 


I decide to move from Leamington Spa, at a latitude of 52°, to Seville, which is lies 
at a latitude of 37°. My grandfather clock, which keeps perfect time, has a pendulum 
of length 75 cm. How long would the pendulum need to be to keep perfect time in 
Seville? 

The buoyancy force on an object is equal to the weight of water that it displaces. If 
an object has mass M and displaces a volume V of water then the forces on it are 
Mg — Vg, in units for which the density of water is 1; see Figure 13.7. 

A bird of mass m is sitting on a cylindrical buoy of density p, radius R, and height 
h, which is floating at rest. How much of the buoy lies below the surface? 

The bird flies away. Show that the buoy now bobs up and down, with the amount 
below the surface oscillating about ph with period 27./ph/g and amplitude m/mR?. 
An open tin can, half full of water, is floating in a canal. The can is 11 cm tall, has a 
diameter of 7.5 cm, and has a mass of 50 g. Show that at rest the can is submerged 
a distance of approximately 6.63 cm below the surface of the canal. If the can is 
pushed down further it will then perform oscillations about its equilibrium position. 
Show that the can bobs up and down every 0.21 seconds (a little under five times per 
second). The acceleration due to gravity is approximately 9.8 m/s? = 980 cm/s’; the 
density of water is 1 g/em?. You can check your answers in a sink with a baked bean 
can. 

A right circular cone, of height h, density p, and with base radius R, is placed point 
downward in a lake. Assuming that the apex remains point vertically downwards, 
show that if the cone is submerged to a depth x then 


(You need not solve this equation.) At equilibrium how far is the cone submerged? 
A dashpot is a device designed to add damping to a system, consisting essentially of 
a plunger in a cylinder of liquid or gas, see Figure 13.8. 
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— 


Fig. 13.8. A dashpot. Illustration © 2001 Airpot Corporation. Airpot is a regis- 
tered trademark of Airpot Corporation. 


Fig. 13.9. A mass-spring-dashpot system. 


It produces a resisting force proportional to the velocity, precisely the kind of 


‘damping’ that we used in our model 


mi + wx +kx =0, (E13.1) 


with jw indicating the ‘strength’ of the dashpot. Dashpots are used in a variety of 
applications, for example, cushioning the opening mechanism on a tape recorder, or 


in car shock absorbers. 


A mass-spring-dashpot system consists of a mass attached to a spring and a dash- 
pot, and is shown in Figure 13.9. A weight of mass 10 kg is attached to a spring with 
spring constant 5, and to a dashpot of strength jz. How strong should the dashpot be 
to ensure that the system is over-damped? What would the period of oscillations be 


if = 14? 


13.8 When first opened, the Millennium Bridge in London (see Figure 13.10) wobbled 
from side to side as people crossed; you can see this on video at www.arup. 
com/MillenniumBridge. Footfalls created small side-to-side movements of the 
bridge, which were then enhanced by the tendency of people to adjust their steps 
to compensate for the wobbling. With more than a critical number of pedestrians 


(around 160) the bridge began to wobble violently. 


Without any pedestrians, the displacement x of a representative point on the bridge 


away from its normal position would satisfy 


Mx +kx+Ax =0, 


2 A detailed analysis is given in P. Dallard et al., The London Millennium Footbridge, The Structural Engineer 


79 (2001), 17-33. 
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Fig. 13.10. The Millennium Bridge in London (courtesy of Arup). 


where 
M~4x10kg, k®5x10+kg/s, and A*~ 10’kg/s?. 


Show that the level of damping here is only around 1% of the critical level. 

The effective forcing from each pedestrian was found by experiment (which in- 
volved varying numbers of people walking across the bridge) to be proportional to x, 
with 


F ® 300x. 
If there are N pedestrians, the displacement of the bridge satisfies 
Mx +kx +Ax = 300NXx. 


Find the critical number No of pedestrians, such that if there are more than No pedes- 
trians the bridge is no longer damped. Show that if there are 200 pedestrians then 
there will be oscillations with a frequency of approximately 0.8 hertz (oscillations 
per second) the amplitude of oscillation of which grows as e!/8°, 

The problem was corrected by adding additional damping, a large part of which 
was essentially a collection of dashpots, in order to bring the damping up to 20% of 
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the critical level. What would this do to the value of k, and how many people can now 
walk across the bridge without counteracting all the damping? 

13.9 In the case of critical damping (see Section 13.3), the general solution of (13.6) is of 
the form 


x(t) = (A+ Bre”. 


Show that if AA < 2B then x(f) increases initially, reaching its maximum value at 


14 


Inhomogeneous second order linear equations 


We now investigate how to obtain solutions of the inhomogeneous equation 


ao 
fae: 40% ters = f(t). (14.1) 


As aconvenient shorthand we define an operator L, cf. (11.10), by 


dx: de 
L[x] =a +b + cx. 


Recall that L[x](t) is a function of f, as is x(t). Using this notation we can rewrite 
(14.1) more compactly as 


L[x] = f(). 


Since (14.1) is a second order equation, we expect that the general solution will 
have two arbitrary constants in order to fit any choice of initial condition x (fo) = 
xo and x(to) = yo. 


14.1 Complementary function and particular integral 


Because of the linearity of the equation, we can split the problem of finding its so- 
lution into two parts. First we try to find the general solution of the corresponding 
homogeneous problem 


L[y] = 0. 


We have already seen how to solve this problem in Chapter 12; its general solution 
will be a linear combination of two independent solutions y; (ft) and y2(t), 


y(t) = Ayi(t) + By2(t). 
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This is called the complementary function, and will provide us with the two arbi- 
trary constants that we require in our final result. 

The second part of solving the problem is to find one particular function xp(t) 
that satisfies the equation, i.e. for which 


L[xp] = f(t). 


The standard method for finding such a particular integral requires a mixture of 
experience and inspired guesswork, but we will see later (Chapter 18) that there is 
also a systematic way of solving this problem. 

Given our complementary function y(t) and a particular integral xp(t), the gen- 
eral solution of the original problem will be given by x(t) = y(t) + xp(¢), since 


Lily) +xp@)] = LlIYO] + Lap] =0+ FO = fo, 
using the linearity of L. Since y(t) = Ay, (¢) + By2(t), our solution 
x(t) = Ay (t) + By2(t) + xp) 


contains the two arbitrary constants that we need in order to fit any choice of initial 
conditions. 

There is nothing special about applying this technique to second order equations. 
The essential point is the linearity, and it is possible to solve first order equations 
this way, as we now see in the following simple example. 


Example 14.1 Use the technique of the complementary function and particular 
integral to solve the first order equation 


ae _ 
rr px = q. 


The ‘complementary function’ is the general solution of the homogeneous prob- 
lem 


y+ py =0, 
which is y(t) = Ae”. A particular integral is any solution Xp(t) that solves 
Xp + PXp =; 


one such solution is xp(t) = q/p. So the general solution is given by the comple- 
mentary function plus the particular integral, 


x(t) = Ae?! + Z 
Pp 


as we found before in (9.5). 
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We now look at certain choices of right-hand side f(t) for which we can find a 
“particular integral’ for 


dx dx 
aan eer +cx = f(t), (14.2) 
i.e. one particular solution xp (7) that when plugged into the left-hand side of (14.2) 
gives the correct function f(t) on the right-hand side. 

We can do this when f(t) is a combination of powers of ft, exponentials (like 
e*'), and sines and cosines. Essentially we try a solution x(t) that looks very much 
like the original f(t), with some adjustments when f(f) is a part of the comple- 
mentary function. This is known as the ‘method of undetermined coefficients’; 
essentially we guess the form of the solution, and include some coefficients in our 
guess, which are then determined by substituting into the equation. 


14.2 When / (ft) is a polynomial 


When f(t) is a polynomial in ¢ then our ‘guess’ for the particular integral xp(t) is 
a general polynomial of the same order as f,, i.e. if the highest power in f(f) is t” 
then our guess is 


xp(t) = ent” + cp-it” | +--+ +0. 


However, we will see that if x(t) = c satisfies the homogeneous equation then we 
have to multiply our guess by t. These ideas are most clearly illustrated by finding 
the particular integral for a number of examples. 


Example 14.2 Find the general solution of the equation 
re 6x = 12, 
We have already found the complementary function, i.e. the solution of 
yty—6y =0, 


which was y(t) = Ae~ + Be~*", see (12.3). Since the right-hand side is a con- 
stant, we try xp(t) = C. All the derivative terms in the differential equation vanish 
and we are left with 


—6C = 12, 


so we need C = —2: the particular integral is x)(¢) = —2. The general solution is 
made up of the complementary function plus the particular integral, 


x(t) = Ae~ + Be~* — 2. 
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Example 14.3 Find the general solution of the equation 
X +x — 6x = 36t. 


In line with our policy above we try xp)(t) = Ct + D for a particular integral. 
(Note that if we were to try xp(t) = Cr without the constant term then although 
we can choose C to give 36t (C = —6) the xp term would give an extra factor 
of —6.) Then we have x, = C, and so we require 


C — 6(Ct + D) = 36t, 


which gives C = —6 and D = —1. So the particular integral is xp)(¢) = —6r — 1 
and the general solution is 


Ae” + Be* — 6t — 1. 
Example 14.4 Find the general solution of 
¥ +4 —6x = 2169° 
For the particular integral we have to try xp(t) = Ct? + Dt? + Et + F. Then 
Xp=3Cr?+2Dt+E and = yp =6Cr+2D, 
and so we need 


6Ct +2D+3Cr? + 2Dt + E —6(Ct? + Dt? + Et + F) = 216°, 


Xp Xp Xp 


—6Ct? + BC — 6D)t* + (6C +2D — 6E)t + 2D + E — 6F) = 216°. 
We want 
—~6C =216, 3C—-6D=0, 6C+2D—6E=0, and 2D+E-—6F =0, 


which yields C = —36, D = —18, E = —42, F = —13, and the particular integral 
is 


p(t) = —36t? — 184? — 42¢ — 13. 


Thus the general solution is 


x(t) = Ae + Be~* — 3613 — 18¢7 — 421 — 13. 


We now see that there is a possible catch with this technique. 
Example 14.5 Find the general solution of 
xX —2x =4. 
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We saw earlier, in (12.4), that the complementary function is A + Be’. If we try 
Xp(t) = C for our particular integral then x, — 2xp will be zero, since we have a 
constant as part of the complementary function. To deal with this we multiply our 
‘guess’ by an extra factor of f, as we did to obtain the second independent solution 
of the homogeneous equation when the auxiliary equation had a repeated root. 

Thus instead of xp(t) = C we try xp(t) = Ct and then we have x, = C and 
Xp = 0, therefore we want 


—2C =4 


which gives a particular integral when xp(t) = —2tr, and so the general solution is 


x(t) = A+ Be™ — 21. 


Example 14.6 Find the general solution of the equation 
x=4, 


This is an extreme example, and it would be easy to solve by integrating twice. 
However, we can also apply our general method for linear equations with constant 
coefficients. The general solution of the homogeneous equation y = 0 is y(t) = 
At + B (there is a repeated root k = 0 of the auxiliary equation), so we cannot 
try Xp(t) = C, and nor can we try xp(t) = Ct; we have to try xp(¢t) = Cr? in this 
case. Since Xp = 2C we need C = 4/2 = 2 and the particular integral is 217. The 
general solution is therefore x(t) = 207+ At+B. 


14.3 When / (t) is an exponential 


The second kind of right-hand side for which we can find a particular integral is 
an exponential, f(t) = ce’. Essentially we try a multiple of the same exponential 
for our particular integral, xp(t) = C e“. However, if the exponential e“ on the 
right-hand side is a solution of the homogeneous equation then Ce“ cannot be 
a particular integral, since substituting this into the left-hand side will just give 
zero (L[Ce*"] = 0). So in this case we need to try ¢ times the exponential, xp)(t) = 
Cte“. In the extreme case when k is a repeated real root of the auxiliary equation 
we have to try xp(t) = Cre. 
We look at the simplest case first. 


Example 14.7 Find the general solution of 


¥+x—6x =4e 7", 
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We saw above (in Example 12.1) that the complementary function, i.e. the 
general solution of the homogeneous equation ¥ + y —6y = 0, is y(t) = Ae~ 
+ Be~*’, and so e~~! is not a solution of the homogeneous problem. This means 
that we can try xp(t) = C e~*! as a particular integral. Because 


Xp=—2Ce~ and ip =4Ce” 
we need 
4C —2C —6C =4, 
i.e. C = —1, so the particular integral is 


~2 
xp(t) = -e 


and the general solution is 


x(t) = Ae + Bew*! — e7 


Example 14.8 Find the general solution of 
¥+%—-6x =5e™. (14.3) 


The complementary function is the same as in the previous example, y(t) = 
Ae~ + Be~’. Because e~*’ is a solution of the homogeneous equation we have 
to try xp(t) = C te~*! for a particular integral for this example. With this guess for 
Xp(t) we have 


xp=Ce"%+—-3Cte* and = xX = —6Ce™’ + 9Cte™, 
and so we need 
—6Ce~** + 9Cte~* + Ce~*! — 3Cte*! — 6Cte** = 5e7**, 


The te~*” terms cancel, and so we want C = —1. The particular integral turns out 
to be 


—3t 
Xp(t) = —te 


giving the general solution 


x(t) = Ae” + Be~* —te~*". 


Example 14.9 Find the general solution of 
X+2x +x =6e%. 
The complementary function y(f) is the solution of 


P+ 2+y =0. 
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Trying y(t) = e* yields the auxiliary equation k? + 2k + 1 = 0, and so k = —1 
‘twice’. It follows that y(t) = Ae~‘ + Bte~’, which means that both our ‘stan- 
dard guess’ xp(t) = C e ‘ and our adjusted guess Xp(t) =C te solve the homo- 
geneous equation. Because of this we have to try xp(t) = C t7e~'; once again we 
deal with the problem by including an extra factor of t. We have 


Xp = Iie = Cre and Xp = Ie aCe Cie. 
Substituting into the left-hand side we get 
[2Ce? —4Cre + Cte] + 2[2Cte! — Ct’e"] + Cre = 2Ce™. 


So we need C = 3; the particular integral is xp(t) = 3t7e~' and the general solu- 
tion is 


x(t) = 3t7 + At + Bye. 


14.4 When f(t) is a sine or cosine 


If f(t) =asinot + Bcosot (including the cases a = 0 or B = 0) then we need 
to try a combination of sinot and cos ot for the particular integral, 


Xp(t) = C sinot + Dcosot. (14.4) 


Note that we need a combination of sine and cosine even if f(t) only involves 
one of these two functions, since trying xp(t) = C sinot means that x,(t) = 
Co cosot. If sinot and cos ot satisfy the homogeneous equation then instead we 
need to try 


Xp(t) = Ctsinot + Dtcosot (14.5) 
with an extra factor of ft. 
Example 14.10 Find the general solution of 
xX +2x +x = 100cos2t. 


We found the complementary function y(t) = (A + Br)e~‘ in Example 14.9. 
Since sin 2¢ is not a part of the complementary function, we can try 


Xp(t) = C sin 2t + Dcos 2t 
as a potential particular integral, cf. (14.4). Then 


Xp(t) = 2C cos 2t — 2D sin 2t and Xp(t) = —4C sin 2t — 4D cos 2t, 
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and so 
Xp + 2Xp + Xp = (4C — 3D) cos 2t — (3C + 4D) sin 2t. 
Therefore we need 
4C —3D = 100 and 3C +4D =0, 
which gives C = 16 and D = —12. Thus the particular integral is 
Xp(t) = 16sin2t — 12 cos 2t 


and the general solution is 


x(t) = (A+ Brje ' + 16sin2t — 12 cos 21. 


Example 14.11 Find the general solution of 
X+x = 8cost. (14.6) 


This example is not quite so simple, since sin t and cost are the solutions of the 
homogeneous problem (the complementary function is A sint + Bcost). Using 
the same remedy as that for Example 14.5, namely multiplying our original guess 
by t, we now try a combination of ¢ sint and ¢ cost, 


Xp(t) = Ct sint + Dt cost, 
cf. (14.5). Then we have 


Xp(t) = C sint + Ctcost + Dcost — Dt sint 
Xp(t) = 2C cost — Crsint — 2D sint — Dt cost, 


so that 
Xp + Xp = 2C cost — 2D sint. 
We need C = 8/2 = 4 and D = 0, giving the particular integral 
Xp(t) = 4t sint. 
The general solution is therefore 
x(t) = Asint+ Bcost+4tsint. 


Note that although the complementary function has a fixed amplitude, the ampli- 
tude of the oscillations produced by having the 4f sint term on the right-hand side 
grows linearly in ¢. This is the phenomenon of resonance, and is discussed in more 
detail in the next chapter. 
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14.5 Rule of thumb 


As atule of thumb for finding particular integrals for second order equations: 


¢ The ‘standard guess’ is a general version of what you are aiming for, e.g. given an nth 
order polynomial on the right-hand side, try a general nth order polynomial; but 

¢ If the standard guess contains terms that satisfy the homogeneous equation, multiply by 
t, repeating this step until the guess no longer contains any terms that solve the homoge- 
neous equation. 


14.6 More complicated functions f (1) 


Similar methods will also work for more complicated choices of f(t) that are 
products and sums of those that we have already considered. To find a particular 
integral xp(t) for the equation 


L[x] = afi (t) + Bho) (14.7) 


(from the definition of L this is simply ax + bx + cx = af) (t) + Bfa(t)) we can 
use the linearity of L to reduce this to finding particular integrals x; and x2 satis- 


fying 
L[xil= fi) and L[x2] = fo(t). 
A particular integral for (14.7) is then xp(t) = wx1(t) + Bxa(t), since 
Ll xp] = L[ox, + Bx2] = aL[x1] + BL[x2] = afi (t) + Bfro(e). 


We can also find particular integrals for products; to do this we can try the 
product of the guesses for each individual factor. For example, to find a particular 
integral for 


X +x —6x = te‘ cos2t 
we would try 
Xp(t) = (At + B)Ce'(Dcos2t + E sin2r). 


Multiplying this out and simplifying the arbitrary constants, we try a particular 
integral in the form 


Xp(t) = (At + B)e~' cos2t + (Ct + Dye“ sin 2r. 
For this choice of xp(¢), 


kp(t) = ((—A + 2C)t + (A — B + 2D))e cos 2t 
+((-2A —C)t+(-2B + C — D))e“ sin2t 
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and 
Xp(t) = ((—3A — 4C)t + (-2A — 3B +4C —4D))e~“ cos 2t 
+((4A — 3C)t + (-44 + 4B —2C —3D))e™ sin 2t 
Substituting in gives 
Xp + Xp — Oxp = ((—10A — 2C)t + (-A — 10B + 4€ — 2D))e‘ cos 2t 
+((2A — 10C)t + (-4A + 2B — C — 10D))e™ sin 2t 
Solving for A, B, C and D to ensure that this equals te~‘ cos 2t gives 


(—8 — 130r)e~! cos 2t + (53 — 26t)e sin 2t 
1352 : 


Xp(t) _ 


Exercises 


14.1 Find the general solution to the following differential equations (the homogeneous 
parts of the equations are all treated in Exercise 12.1) In part (7) also find the one 


solution that has x(0) = n and x(0) = 0. 
(i) ¥-—4x = 72, 
(ii) ¥ — 4% = 2?, 
(iii) ¥ + % — 2x = 3e%, 
(iv) +4 —2x =e’, 
(v) X+2x+x=e7%, 
(vi) fora 4 w:¥ + wx = sinat, 
(vii) fora = w:% + ox = sinat, 
(viii) ¥ + 2x + 10x =e’, 
(ix) X + 2x + 10x = e' cos 31, 
(x) ¥ + 6% + 10x = e* cost, and 
(xi) ¥ + 4% + 4x =e”. 
14.2 Find a particular integral for 


¥ +x —2x = 12e7 — 6e’. 


(You might find parts (iii) and (iv) of the previous exercise useful.) 
14.3 If you are feeling strong, find a particular integral for 


X + 4x = 289te’ sin 2r. 


15 


Resonance 


We now consider in more detail the resonance phenomenon mentioned briefly in 
the previous chapter while we were dealing with Example 14.11. 


15.1 Periodic forcing 


Suppose that x(t) denotes the distance of some system from its equilibrium posi- 
tion, and that without any external forcing the system would oscillate about this 
position, with x(t) satisfying 


x= ox. (15.1) 
We know from Chapter 13 that this equation has the general solution 
x(t) = Acosat + Bcoswt = M cos(at — ¢), (15.2) 


where in the more compact form M = JA? + B2 and ¢ = tan7!(B/A). We refer 
to 27/w as the ‘natural frequency’ of the system; it is how the system ‘likes’ to 
oscillate if left to itself. 

We now consider what happens if we apply an external forcing to the system 
that is also oscillating, 


¥ + 07x =acosat. (15.3) 


Note that the forcing function on the right-hand side has amplitude a and oscillates 
with frequency 27/a, cf. Figure 9.1. 

Such equations arise in many physical situations, and we now provide a physi- 
cal motivation by considering a simple model of the spinning drum of a washing 
machine. We suppose that the cylindrical drum has mass M, and that it is loaded 
unevenly with clothes of mass m, such that the centre of mass of the clothes lies 
a distance r from the axle. In order to control the oscillations of the drum, it 
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mass m 


f a 
rest position 6 
t— x 


x 


Fig. 15.1. A simple-minded model of a washing machine. 


rotates within another cylinder that is attached to the sides of the housing with 
large springs, each with spring constant k/2, see Figure 15.1. 

For simplicity we will assume that the drum can only move from side-to-side, 
and we denote by x the horizontal displacement of the axle from its rest position. 
The horizontal coordinate of the centre of mass of the drum and clothes is given 
by 


mr cos@ + Mx 
m+M , 


Assuming that the drum spins with constant angular velocity 6 = a, so that @(t) = 
at, this gives 


mr cosat + Mx 


aE m+M 


The springs provide a restoring force of magnitude kx for some constant k, and so 
Newton’s second law, 


x 
(M + Mn) a = —kx, 


gives 


nae = + =| ee 


m+M 
Simplifying this equation we obtain 


Mx+kx= mr? cos at. 
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With no load (m = 0) or with a perfectly balanced load (r = 0) the displacement 
from the rest position satisfies 


¥+o*x= 0, 
where we have defined w* = k/M. With the off-centred load the equation is 
¥+o’x =a cos at, 


precisely (15.3), where the forcing term on the right-hand side has amplitude a = 
mra?/M. 


15.1.1 No resonance: bounded response 


First we consider what happens when the system is forced at a frequency that dif- 
fers from its own natural frequency, i.e. when a 4 w. The complementary function 
(given in (15.2)) is y(t) = Acos wt + B sin wt, and so we can try xp(t) = C cos at 
for the particular integral (there is no need for the sin at term since there is no x in 
the equation). Substituting in we get 


—Ca’ cos at + w°C cos at = acos at, 


and so we want 


The general solution is therefore 
x(t) = Mcos(wt — $) + =——5 cosa, (15.4) 
o-—a 


and we can see that the motion of x(t) combines oscillations at two frequencies: its 
‘natural frequency’ w, and the forcing frequency a. However, notice that as a gets 
closer to w, the amplitude of the second term increases (although it is bounded for 
each fixed choice of @). 


15.1.2 Ideal’ resonance: unbounded response 


When @ = our usual ‘guess’ (xp(t) = C sin wt + D cos wt) solves the homoge- 
neous equation, and therefore we now have to try xp(t) = Ct sinwt + Dt cos at 
as the particular integral. For this we have 


Xp = Csinwt + Cat coswt + Dcoswt — Dw sinwt 
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Fig. 15.2. A graph of the function x(t) = ¢ sin 2t. The amplitude of the response 
to a forcing at an object’s natural frequency grows linearly in time. 


and so 
Xp = 2Ca@cos at — 2Dwsin wt — wo [Ct sin wt + Dt cos wt]. 
Therefore 
Xp + Wo Xp = 2Cwcoswt —2Dwasinat; 


since we require the right-hand side to equal a cos wt, for our particular integral 
we need 


Ged 
Xp(t) = —t sinat. 
i Baal 


You can see from a graph of xp(¢) in Figure 15.2 that the amplitude of the result- 
ing oscillations grows linearly. After a time this will, of course, become the main 
component of the solution 


: a : 
x(t) = Csinwt + Dcosat + a sin wt, 
7) 


since the first two terms represent an oscillation of fixed amplitude. 

Forcing a structure at its natural frequency can have disastrous consequences; 
on 14 April 1831, the Broughton suspension bridge over the River Irwell collapsed 
when the 60th rifle corps marched over it in step, thereby forcing it at one of its 
natural frequencies. Figure 15.3 shows the report from The Times published on 
15 April 1831. Armies on the march now break step over bridges to prevent this 
occurring (see Figure 15.4). 


15.2 Pseudo resonance in physical systems 145 


hye ANT Regan IRENE NAEAS BASES FUR UERL Yi NESE RE URE BEN OSD SE) 
Fas. or Brovanton Suspension-Bripet.—Y es- 

-terday a most frightful accident occurred to « party of the 
0th Rifles, consisting of abeut 70 officers and men, who were 
returning trom exercising on Kersal-moor, by the Broughten- 
road, to their quarters in Regent.road barracks. About-a 
| quarter-past 12 at nvon the party reached the Chain-bridge, at 
Sroghin, and commenced filing across it. Scarcely bad 
ithe leading file, preceded by three of their officers, touched 
terra firma on the Pendleton side, when a tremendous noise, 
resembling ati irregular volley of musketry, arrested the at- 
tention of the party. In an instant after, the bridge separated 
at the end next the tell-house, and, witha large quantity of. 
ry stone pier was precipitated into the river. The situation 
of the soldiers at this moment was most fright{ul—about 40 of 
ithe party, encumbered with their arms and accoutrements 
lwere entangled with the railing of the bridge, or overwhelmed 
in the-river. Most providentially the river was low, and thus 
ino lives were lost. ut we are concerned to state that 6 are] 
‘now in the hospital with fractured limbs, some of whom, it is 
feyred, will be disabled for life; and 14 have been injured more 


pr jess by cuts, contusions, &¢ — Manchester Herald. 


In sancennened of the scarcity nf orase. fleah meat | ' 


Fig. 15.3. The Times, 15 April 1831 ‘FALL OF BROUGHTON SUSPENSION-BRIDGE’. 


15.2 Pseudo resonance in physical systems 


There is almost invariably some damping in physical systems, and when there is 
damping you will not see this ‘ideal’ resonance (meaning linear growth of the 
amplitude). However, there will still be a frequency at which the amplitude of the 
resulting oscillations is significantly larger than the amplitude of the forcing. 

We will consider the equation 


¥ 41% + 07x =acosat 
where the damping coefficient A is strictly positive but not too large (we will soon 
be precise about what is ‘not too large’). In Chapter 13 we discussed the homo- 
geneous equation (j) + Ay + w?y = 0) in detail, and for A* < 4@* we found the 
complementary function 


y(t) = era cosot+ Bsinot), 


where o = 5V 402 — 42. Since 4 > 0 this oscillates and decays to zero exponen- 
tially (in the language of Chapter 13 the system is ‘under-damped’). 
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| 


Fig. 15.4. A sign on the Albert Bridge in London, instructing troops to break step 
to prevent destructive resonance effects. (Courtesy of the National Information 
Service for Earthquake Engineering, University of California, Berkeley.) 


The particular integral will be some combination of sin at and cos at (these are 
not part of the complementary function), 
Xp(t) = C sinat + Dcosat. 
Substituting this guess into the equation, we want 


—Ca’ sinat — Da? cosat + A[Cacosat — Da sinat] + 


w[C sinat + Dcosat] = acosat. 
Collecting coefficients of sin at gives 
C(w — a’) = Dha, 
while by equating the coefficients of cos at we obtain 
D(w* — a’) =a-—Cha. 
Solving these simultaneous equations gives 


CH aha rr ee a(w* — a) 
— (w2 = a2)? a (Aa)? = (aw = a2)? =e (Aa)2’ 


15.2 Pseudo resonance in physical systems 147 


and so the particular integral is 


aha ; a(w* — a”) 
Xp(t) = Gaeta + (Gara (15.5) 
We have already seen (in Section 9.4.3) that we can combine the two terms in 
an expression like A cosat + B sinat to give one oscillating term; in particular 
the amplitude of the resulting oscillation is given by the square root of the sum of 
the squares of the two coefficients, V A? + B2. So the amplitude of the oscillations 
that arise in response to the forcing a cos at is 


2 9 4 2\2 
ne =a (Aa)* + (@* — a*) 


[(@? — a7)? + (Aa)? 
a 


© V2 a) + Gay? 

The ratio of the amplitude of the response to that of the forcing is therefore 
1 

V(@? = 0)? + (ha)? 


which depends on the frequency 271/a of the forcing. Graphs of F(a) for various 
values of A when w = | are shown in Figure 15.5. Note that the maximum value 


F(a) = 


(15.6) 


Fig. 15.5. Pseudo resonance when w = 1: graphs of F(a) for A = 1, 1/2, 1/4, 
1/8, 1/16 and 0, increasing as A decreases. 
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of F(a) occurs near a = | (i.e. where w = w), and increases as the damping level 
d decreases. 

The maximum value of the response occurs when the denominator in F'(q@) is a 
minimum, so when 


d 
—[(w* — a*)? + (Aa)?] = 0, 
da 

i.e. when 


—da(w? — a?) +2470 =0, 


* = w — (1/2)A’. It is therefore clear that as the damping becomes 


which gives @ 
ever smaller the value of a for which the response is maximum becomes increas- 
ingly close toa =o. 


The maximum value of the amplitude is 
2a 
AV 4a? — 22° 


and of course increases in magnitude as 4 decreases. As A > 0, Rmax ~ a(Aw)~!. 


Rimax (A) = 


Exercises 
15.1 For a 4 @ show that the solution of the equation 
¥ + wx = cosat (E15.1) 
with x(0) = x(0) = Ois 


x(t) = 


5 5 (cos at — cos at). (E15.2) 
w-—a 


15.2 Use the double angle formulae 


cos(@ + @) = cos@ cos @ + sind sing 


to find an expression for cos x — cos y as a product of two sine functions, and hence 
rewrite the solution in (E15.2) as 


2 _ (@+a)t . (w-—a)yt 
2 sin 5 sin Se 
If a is close to m then |~ + @| is much larger than w — a; one of the two terms 

oscillates much faster than the other. A graph of such an expression when w = | and 
a = 0.8 is shown in Figure 15.6. The periodic variation of the amplitude of the basic 
oscillation is known as beating. You can hear this when, for example, two flutes play 
slightly out of tune with each other. 

15.3 When a = @ show that the solution of (E15.1) with x(0) = x(0) = 0 is x(t) = 
t sinwt/2w. Recover this solution from that for a ~ w by letting a > w in (E15.2) 
and using L’ H6pital’s rule. 


w@2 
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1 
100 150 


Fig. 15.6. The phenomenon of beats: the graph of sin0.9¢ sin0.1¢ against t (the 
dashed line shows how the amplitude of the faster oscillation varies like sin 0. 1f). 


L’H6pital’s rule: if f(x) > Oas x — a, and g(x) > 0 as x — a then 


_ Fh) 4 fe) _ f'@ 
lim —— = lim = 

x>a g(x) xa g!(x) g'(a) 
(provided that f and g have continuous derivatives at x = a). 


15.4 A model for the vibrations of a wine glass is 


X+AX+ 07x =0, 


where A and w are constants. Suppose that when struck the glass vibrates at 660 Hz 
(about the second E above middle C on a piano). Show that 


V 4a — A? = 26407. 


If it takes about 3 seconds for the sound to die away, and this happens when the 
original vibrations have reduced to 1/100 of their initial level, show that 
2 log 1 
if ee og By: 
3 

and hence that A = 3.07 and w = 4.15 x 10° (both to three significant figures). 

The glass can stand deforming only to x * 1. A pure tone at 660 Hz is produced 
at D decibels and aimed at the glass, forcing it at its natural frequency, so that the 
vibrations are now modelled by 


19(2/10)—8 
tA + ox = —z— 08132071. (E15.3) 


How loud should the sound be, i.e. how large should D be, in order to shatter the 
glass? (Decibels are on a logarithmic scale, hence the exponential on the right-hand 
side of (E15.3). The strange factor in front of the forcing produces roughly the correct 
volume level.) 
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Higher order linear equations with constant coefficients 


The methods that we have developed to treat second order linear equations extend 
in a straightforward manner to treat higher order linear equations. 
A general nth order linear ODE with constant coefficients can be written in the 
form 
d’x dt hy 


an + ay 


dx 
aes Heeb aie + aox ft) (16.1) 


-l drr-1 


where we assume that a, 4 0, cf. (3.5). 
The initial value problem in which x (f) and its first n — 1 derivatives are speci- 
fied has a unique solution. 


Theorem 16.1 Given an initial condition 


n-1 


* (to) =%n-1 (16.2) 


XG) =%0, Xo)=*1, FG) --, Foy 


the linear equation (16.1) has a unique solution defined for allt € R. 


Note that in the statement of the theorem it is claimed that the solution exists for 
all t € R. This is because for such a linear equation it is possible to guarantee that 
the solution does not blow up in finite time (cf. Section 6.3 and Exercise 12.3). 


16.1 Complementary function and particular integral 


In order to find an explicit solution of an nth order linear problem we proceed as 
we did for second order problems. The only significant difference is that now we 
need n linearly independent solutions of the homogeneous equation in order to 
be able to satisfy the n initial conditions in (16.2). See Definition 11.2 for linear 
independence, and also Section 16.2. 
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First we find the complementary function by solving the homogeneous equation 
d’y qv! 


me : 
On aan tn gaat +17 + aoy = 0: 


we try y(t) =e“. This will give an nth order algebraic equation for k, 
dank” + an_1k"! +--+» +a9 =0. 


For n < 4 there are methods for finding the roots of such equations,! although for 
higher order equations we would have to use numerical methods. 

For each non-repeated real root k of the equation we have a solution e*’; for a 
real root k that is repeated m times we have m linearly independent solutions, 


kt 


ev te’, Mee and pm—lokt. 


for each non-repeated complex conjugate pair p + iw we have the two solutions 
e?! sin wt and e?' cos wt, while for a complex conjugate pair p + iw that is repeated 
m times we would have the 2m solutions 


e”' cost, ce?’ sinwt, te?’ coswt, te?’ sinwt, ..., 


prt prt 


e”' cos ot, e” sinat. 


Finding a particular integral involves the same style ‘guesswork’ as in the sec- 
ond order case. 


Example 16.2 Find the general solution of 


d*x : 
— — 16x = 64sin2t. 
dt* 


! Fora general cubic equation 
x? — bx? +ex —d=0 
first substitute x = y + b/3 which gives 
etmx=n with m=c—b?/3 and n=d —be/3 + 2b9/27. 


Tartaglia developed a method of solution for this form of cubic (first published by Cardan in 1545); notice that 


(a—b)> + 3ab(a — b) =a? — 
and so if a and b satisfy 

3ab =m and a—b=n 
then a — b solves x3 + mx =n. Since b = m/3a the second of these equations is 

3 
Sieh oe 
27a3 

which gives a® — na? — m3 /27 = 0, a quadratic equation for a>. We can find the roots of this equation using 
the quadratic formula, from which we get a by taking cube roots. The value of b is given by b = m/3a, and 
then a — b is a solution of the original equation. 
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To find the complementary function, i.e. the general solution of 


dty 


drt 
we try y(t) = e* and so we need k* = 16. The solutions of this are k = +2 and 
k = +21, and so the complementary function is 


— loy = 0, 


y(t) = Ae~ + Be~” + Ccos2t + Dsin2t. 


To find a particular integral we cannot try a combination of sin 2¢ and cos 2t since 
they solve the homogeneous equation, so we have to try a combination of f sin 2f 
and f cos 2t. If 


Xp(t) = Et sin 2t + Ft cos 2t 


then after some algebra 
4 
a 


and so substituting in we want 


= E[16t sin 2t — 32cos 2t] + F[32 sin 2t + 16t cos 2r], 


—32E cos 2t + 32F sin2t = 64sin2t, 


ie. E =O and F = 2, giving a particular integral xp)(t) = 2t cos 2t and hence the 
general solution 


x(t) = Ae! + Be! + (C + 2t)cos2t+ Dsin2t. 


16.2 *The general theory for nth order equations 
In Chapter 11 we developed a general theory for second order linear homogeneous 
equations 
2s. 
dr? 
showing that given two linearly ce solutions x; (¢) and x2(t) of this equa- 
tion we can form the general solution as a linear combination 


a pine ~ + p2(t)x = 0, (16.3) 


c1x1(t) + c2x2(t). 


Realising that the collection of all solutions of (16.3) is the same as the kernel of 
the linear operator L : C 2(1) > C°(1) defined by 


L[x] =x + pi(t)x + pott)x, 


we could re-express our results as “dim ker(L) = 2’. 
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It is possible to generalise the results of Chapter 11 to treat the linear nth order 
equation 
ny n— ly 
oe TPO et poh ae oe + Palt)x = 0; (16.4) 


in particular, similar arguments show that the general solution of (16.4) can 


be formed from linear combinations of any n linearly independent solutions 
x1(t),..., Xn (t) of (16.4), 


x(t) = cy x(t) +--+ + yx) (t). 


Denoting by C”(/) the collection of all functions defined on n which have n con- 
tinuous derivatives, we can define a linear operator L : C"(I) > C OG by 


n—1 


d’x dd’ "x 
ah + PAG) at a eae ap or Woe ~ + Pu(t)x 


(we now need x € C”(/) to make sure that the term d”x /dt” is sensible). We can 


then express these results by saying that ‘dim ker(L) =n’. 
Checking whether two functions are linearly independent is straightforward, 


L[x] = 


since we saw that they are linearly independent if and only if they are not pro- 
portional. When there are three or more functions involved we have to make 
more systematic use of the Wronskian, which we discussed only in passing in 
Chapter 11. Given n functions f)(t),..., fy (t), we define their Wronskian to be 
the matrix determinant 


filt) se fn(t) 
WIft,---> frl@ = "he 
Co Fi fdr OW). 635. AE rw) 


Note that for n = 2 this reduces to W[ fi, fol@) = fi fo — fof, as used in 
Chapter 11. As for the case of two functions, it is possible to show that n solu- 
tions of the linear equation (16.4) are linearly independent on an interval J if and 
only if their Wronskian is never zero on J. Exercises 16.2—16.4 lead you through 
some of this theory in the case of three functions f;, fo and f3. 
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16.1 Find the general solution of the following equations: 


(i) 
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16.2 


16.3 


16.4 
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(ii) 
y” — 3y’ +2 =sinx, 
(iii) 
d*x dx 4 gt Pisa a = 
= sin 
dé de de od 
(if x = e*' one solution of the corresponding quartic equation is k = 1 + i), and 
(iv) 
d*+x dx , 
ar = aa? + 4x =e. 


The linear independence of three functions /|, fo and /3 on an interval J depends on 
the number of solutions of the equation 


afi) tarfr(t)+a3fs(t)=0 forall rel. 


By differentiating this equation once, and then once more, show that a1, a2 and a3 
satisfy the matrix equation 


fi fr ai) oy 0 
dfi/dt dfr/dt = df3/dt a2 }=]0 
d fi/dt? d?fo/dt? dd? f3/dt? 7 \a3 0 


Deduce that if W[f1, f2, f3](t), the Wronskian of f,, f2 and f3, defined as 


fi io BB 
WIS, fo, BIO =| dfi/dt dfo/dt dfs/dt |, 
d? fi/dt? d? fo/dt? d? f3/dt? 


is non-zero for any ¢ € J then f|, f2 and f3 are linearly independent. 
Show that any three solutions of a third order linear differential equation are linearly 
independent on an interval Jif and only if their Wronskian is non-zero on J. 
Suppose that x1, x2 and x3 are three solutions of the third order linear equation 
dx d? 
d 


x dx 
ae + PO Ts + IMs, +r(t)x =0, (E16.1) 


all defined on some interval J. 
We now show that, just as for two solutions of a second order linear equation, 
dw 
— =—p(t)W (E16.2) 
dt 
(cf. Exercise 11.2). You will need various properties of determinants, which you can 
prove by longhand (if you wish) in the next exercise. 
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(i) By differentiating the determinant form of the Wronskian, show that 


xX] X2 X3 
W= x1 x0 x3 : 
d3x,/dt? d3x2/dt? d3x3/dt? 
(You will need parts (i) and (ii) of the next exercise.) 
(ii) Substitute in for d>x j jdt? using the differential equation (E16.1), and hence 
show that 


X} X23 
W=-p@)|i1 in ds], 
X1 XQ X3 
i.e. that (E16.2) holds. (You will need parts (ii) and (iii) of the next exercise.) 
(iii) Solve equation (E16.2) to find an expression for W (f) involving an integral, and 
deduce that either W(t) = 0 for all t € J, or that W(t) 4 0 for allt € J. 
16.5 For the previous question you will need the following properties of determinants: you 
should be able to prove them in the 3 x 3 case treated here by simple (if laborious) 
calculation, using the explicit expression for the determinant of a3 x 3 matrix 


abe 
s t|=a(sz—ty) -—b(rz—tx)+c(ry — sx). 
x y Z 
(i) 
a |¢ bec abé abe abe 
=| Gia ee ll 
y 2 xX y Z He Yo a x y Z 


(i.e. differentiate one row at a time; this is essentially the product rule), 
(ii) If any two rows are proportional then the determinant is zero. Check this for 


a b Cc 


ha Ab rhre| = 0, 
x y 2Z 
(iii) Determinants depend linearly on their rows. Show this for the case 
a b c 
r S t 
ax; + Bx2 ayit+ By. 21 + B22 


a bee a boc 
=alr s tl|+ flr s t 


Part III 


Linear second order equations with 
variable coefficients 


17 


Reduction of order 


In Chapter 11 we discussed the general theory of second order linear equations. In 
the intervening chapters we have concentrated on linear equations with constant 
coefficients, but we now return to the more general case in which the coefficients 
are allowed to be functions of f, 
d°x dx 

alta + ate eg +c(t)x =0. (17.1) 
We saw in Chapter 11 that in order fully to solve a second order homogeneous lin- 
ear differential equation we need two linearly independent solutions. In this chap- 
ter we show that if we happen to know, or can guess, one solution of an equation 
like (17.1) then there is a systematic way to find a second, linearly independent, 
solution. 

The method is called ‘reduction of order’, since it enables us to use our knowl- 
edge of one solution to find a first order differential equation that we can use to 
find the second solution. 

Suppose we know that u(t) solves the second order linear equation 


7 d 
ain a bo +c(t)x =0. (17.2) 


The idea is to make the substitution x(t) = u(t)y(¢) and then solve the resulting 
equation for y(t). From x(t) = u(t) y(f) it follows that 


xX =uy+uy X =uy+2uy+uy, 
and substituting these into the original equation gives 


a(t)(iiy + 2uy + uy) + b(t)(uy + uy) + c(t)uy = 0. 
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The terms in which the factor of y is not differentiated, 
a(t)uy + b(t)uy + c(t)uy = yla(t)u + b(t)u + c(t)], 


add to give zero; the expression in square brackets vanishes since u is a solution. 
So we are left with an equation for y(t), 


a(t)Quy +uy)+ b(t)uy = 0. 


Note that although y and y occur in this equation, y itself does not. So we can 
make a second substitution, putting z = y to obtain a first order equation for z(t), 


[a(t)u(t)]z + Raut) + b@)u@)]z = 0. (17.3) 


Although this equation looks unpleasant, the coefficients of z and z are known 
functions of ft, since a(t) and b(t) are from the original problem and u(t) is the 
solution that we already know. This first order equation can now be solved using 
the method of integrating factors.! 

As with many of the techniques developed here, it is not helpful to try to remem- 
ber the equation (17.3) and produce it from up your sleeve whenever you need to 
use this idea. Instead you should understand the technique and be ready to apply it 
to particular examples when it is needed. 


Example 17.1 One solution of 
¥—2x+A72x =0 


can be found by trying x(t) = e*'. The resulting quadratic equation has a repeated 
root k = i. Use the reduction of order method to show that there is a second lin- 
early independent solution te™. 


To use the reduction of order method we substitute x(t) = y(t)e*’. We then have 
x = Ae y + ey and =X = A7*e“y + 2re" pV +e" 5, 
so that 
Arey + 2xe™ ype y= 2176) 4 one 4-16 y = 0. 
After cancelling all the e*’s this reads 
My + 2aAyt 7 —2d7y — 2ay+a7y =0, 
and further cancellations leave just 


7=0. 


! This possible to find a general integral expression for the solution of this equation, see Exercise 17.7. However, 
it is a much better idea to apply the method afresh in each particular case. 
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Integrating this twice gives y(t) = A+ Bt, and so 
x(t) = Ae’ + Bre™. (17.4) 


We obtain a constant multiple of our original solution (e*”) plus a constant multiple 


of a new, second solution, te’, as claimed. 


Example 17.2 For t > 0 the function u(t) = 1/t is a solution of 
t% — 2x — 4x = 0. 


(This is easy to check: X = —t~? and X = 2t~3.) Find a second linearly indepen- 
dent solution. 


To apply the reduction of order method, we set x = y/t, and so 


bt Dey Le ee Aes 
algae) and foe et oe 


~ 


Substituting these into the equation gives 
Fie tng Mey ad ay ae | my. ae 
t ne Ea? t 
This simplifies to give 
ty —4y =0, 
which setting z = y gives the linear equation 
zZ- -z=0. 
t 


We can solve this using an integrating factor, 


4 -4 
exp = dt} = exp(—4Inrt) =r. 


¢ t~*z(t)) =0 
ar z(t)) = 0. 


So we have 


Integrating this gives t~+z(t) = c, and so 
z(t) =ct*. 


Since z = y, it follows that y(t) = at? +b, giving (since x(t) = y(t)/t) 


b 
x(t) =at* + = 
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second term just repeats the solution we already knew, so the new linearly 


independent solution is f+. 


Exercises 


For further examples of the reduction of order method see also Exercises 18.1 (vi), 18.1 
(vii) and 20.3. 


17.1 


17.2 


17.3 


17.4 


17.5 


17.6 


One solution of the equation 
PH-(P +2)y+(t+2)y =0 


is y(t) = ft. Use the reduction of order method to find a second solution, and hence 
write down the general solution. 
One solution of 


(x — Dy” —xy +y=0 


that is valid for x > 1 is y(x) = e*. Find a second linearly independent solution z(x), 
and check that the Wronskian of y(x) and z(x) is non-zero for x > 1. 
One solution of 


(t cost — sint)x + xtsint — x sint =0 


is x(t) = t. Find a second linearly independent solution. 
One solution of 


GSP i +4.0= Pepe S0 
is x(t) =e‘. Find a second linearly independent solution. 
One solution of 

yxy +y=0 


is y =x. Find a second linearly independent solution in the form of an integral. 
Expanding the integrand in powers of x using the power series form for e*, 


n=0 


and assuming that the resulting expression can be integrated term-by-term show that 
this second solution can be written as 


ee) 2n 
y(x) =A 1 + » i al =| 


(cf. Exercise 20.2(1)). 
One solution of 
fang 5 Ganda cote 
an t— — 3— an co a 
de di : 


is x(t) = sint. Find a second linearly independent solution. 


17.7 


17.8 


17.9 
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(T) If we know one solution u(t) of the equation 
a oF oe + q(t) 0 (E17.1) 
———. — a— 7 
Ge > ag 

then the reduction of order method with x(t) = u(t) y(t) leads to the first order linear 


equation 
u(t)z + [2u(t) + p(t)u(t)]z = 0. 
for z = y (cf. (17.3)). Show that 
Ae J pi)dt 
u(t)? 
and hence find the second linearly independent solution in the form of an integral. 


(T) Suppose that the two solutions of a second order linear differential equation 
(E17.1) are u(t) and v(t). Use the result of the previous exercise, to show that 


d 22) _ Aew Sf PO de 


z(t) = 


’ 


dt | u(t) u(t) 
and hence that 
uv — vu 
pt)=-— 2 
uv — vu 


Find the function g(t) such that u(t) is a solution of 


ax uv—vidx 
dt? uv — vie dt 


+q(t)x =0 


(rearrange the equation for q(t), and substitute x(t) = u(t)) and hence show that the 
second order linear differential equation with solutions u(t) and v(t) can be written 
as 


wy AX matt Gis 
(uv — vi) FF + (av —uv)x = 0. 


This produced Exercises 17.1—17.6 above. 

Using the result of the previous exercise, find a second order linear differential equa- 
tion whose solutions are e’ and cos t. Check that both of these two functions satisfy 
the resulting equation. 
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*The variation of constants formula 


In the last chapter we saw that knowing one solution x) (¢) of a homogeneous linear 
second order equation 


d2x 
ain ar bins +c(t)x =0 (18.1) 


enables us to find a second nee independent solution x2(t). In this chapter we 
see that if we know two linearly independent solutions of (18.1) then there is a 
systematic way to finda eee integral for the inhomogeneous problem 


aye et bn = + c(t)x = f(t). (18.2) 


For simplicity we assume that a(t) : 0, divide equation (18.2) by a(t), and 
rewrite it as 
d2x 
di? 
Now suppose that we know two hare independent solutions x(t) and x2(t) 
of the homogeneous linear problem 


+ pine “+ q(x = = g(t). 


X + p(t)x + q(t)x =0, 
which means that its general solution is of the form 
X(t) = Axi (t) + Bxo(t). (18.3) 
We will look for a particular integral for the inhomogeneous problem 
X + p(t)x +.q(t)x = g(t) (18.4) 
in the form 


X(t) = uy(t)x1(¢) + u(t) x2(t). (18.5) 
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We have replaced the constants in (18.3) by functions of ft; this is known as the 
method of ‘variation of constants’. We will obtain a particular solution with wu 
and u2 given in terms of integrals; in particular examples we may not be able to 
compute these integrals explicitly. 

Soon we will substitute this form for x(t) into equation (18.4), but first notice 
that this will only provide one equation that has to be satisfied by u(t) and u(t). 
Since we have two unknown functions we need two equations to determine them 
completely; we are therefore free to impose an additional condition of our choice, 
and we make a choice below that simplifies our calculations. 

The first thing to do is to compute the derivative of x(t), 


X(t) = ui (t)x1(t) + u(t) x1 (t) + u2(t)x2(0) + u2(t)x2(0). (18.6) 


When we differentiate again we will get second derivatives of the us and the xs; to 
get rid of the second derivatives of the functions u; (which are unknown) we use 
our “extra equation’, imposing the condition that 


Ui (t)x1(t) + u2(t)x2(t) = 0. (18.7) 
If this holds then 
X(t) = uy (t)x1(t) + u2(t)x2(0), 
and so 
X = Ux, + uy x1 + u2x2 + u2X2, 


with no second derivatives of the unknown functions uv; and up. 
Substituting these derivatives into equation (18.4) gives 


UX] + Uj + UX. + u2kX2 + p(t)[uix1 + u2x2] + q(t)[uix1 + u2x2] = g(t). 
This looks unpleasant, but if we group the terms correctly, 
uy[X1 + p(t)x1 + q(t)x1] + u2lx2 + p(t)x2 + q(t)x2] + uix1 + u2x2 = g(t), 


we can use the fact that x; and x2 both solve the homogeneous equation to set both 
terms in the square brackets to zero, and end up with 


WX + u2xX2 = g(t). 
We now have two equations for x; (t) and x2(t), 


rae + u2x2 = 0 
UjX1 + U2Xx2 = g(t). 
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We can solve these equations for 7; (t) and u(t) to give 


X2(t)g(t) 
X1(t)x2(t) — x2(t)x1 (0) 


uj(t) = — 


and 


x1(t)g(t) 


ha(t) = x1 (t)x2(t) — x2(t)x1 (1) 


As a shorthand we will write 
W(t) = x1 (t)x2(t) — x2(t)x1 (0) 


for the denominator in these equations. ! Now to find w(t) and u2(t) we integrate, 
and therefore obtain 


x(t) = —xy f OE dees ae | t. (18.8) 


We will now do two examples, one for which we already know the particular 
integral, and one for which we do not. 


Example 18.1 Use the formula (18.8) to find a particular integral for the equation 
¥+x%-—6x =5e 
(this was Example 14.8). 


Note that generally it is not a good idea to try to remember the formula (18.8). 
It is much better to apply the method itself, which will lead naturally to the same 
expression, and we will do this in the next example. 

Two linearly independent solutions of the homogeneous equation are x1(t) = 
e >! and x2(t) = e7! (we found these in Chapter 12), for which 


W(t) = x1 (t)X2(t) — x2(t) x1 (t) = 2e-%e* — —3e%e7*! = Se". 


(Note that W(t) is never zero.) The formula (18.8) gives 


See —3t 5e tet 
th= dt of =a, 
a) -< f 5e ve 5e—¢ 
=—e / 1 dt + 5e~! : e-! dt 
= —te >! = et 


! Th fact this is the Wronskian of x 1(t) and x2(t), as defined in Section 11.4; we saw there that if x; and x2 are 
linearly independent then their Wronskian is never zero, and so our expressions for #1 and #2 make sense. 
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We have once again found the particular integral —te~*’. The second term here is 
just a multiple of one of the solutions of the homogeneous equation and so can be 
absorbed into the complementary function; for the general solution 


x@) = Ae 4 Bet = fe Fe SA oe 4 Be" te 


and A — 1 is just another arbitrary constant. 


Example 18.2 Find a particular integral for the equation 
xX+x =tant. (18.9) 


For this example we will follow the method outlined in general above, rather 
than just plugging functions into the resulting formula. 
Two linearly independent solutions of the homogeneous equation 


X+x=0 
are x1(t) = sint and x2(t) = cost, so for a particular integral we try 
x(t) = u(t) sint + v(t) cost. 
The first derivative of x(t) is given by 
x =usint+ucost+uvucost — vsint, 


and here we impose an additional condition to make sure that there are no second 
derivatives of u or v in x, 


usint + vucost = 0. (18.10) 
This means that x is given by 

x =ucost — vsint, 
and we can differentiate to find 

xX =ucost —usint — vsint — vcost. 
Substituting for x and x in (18.9) gives (after some cancellation) 
ucost — vsint = tant. (18.11) 

Equations (18.10) and (18.11) are a pair of simultaneous equations for uw and v, 


aeons =0 


ucost — vsint = tant, 
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with solution 


: : sin? t 1 
u = sint and v= = cost — —— 
cos t cos t 
Integrating these two gives” 
u = —cost v = sint — In|sect + tant, 
and so a particular integral is 
x(t) = —cost sint + sint cost — In| sect + tant| cost 


= —In|sect + tant|cost. 


You can check this by substitution (this is less work than it looks if you remember 
that In | sec t + tant| is the integral of sec r). 


Exercises 


18.1 Use the method of variation of constants to find a particular integral for the following 
equations: 
(i) y” — y’ — 6y = e* (you could use the method of undetermined coefficients for 
this example, which would be much more sensible); 
(ii) X-—x =1t7! (you can leave the answer as an integral); 
(iii) y” + 4y = cot 2x. Hint: f cosec xdx = In |cosec x — cot x]; 
(iv) 17% — 2x = f3 (to find the solutions of the homogeneous equation try x = f*, 
see next chapter); 
(v) x — 4x = tant (leave your answer as an integral); 
(vi) 
2 dy 3 dy 4 2 aD) 
(tan* x — 1) —> — 4tan? x — + 2ysec’ x = (tan’ x — 1)(1 — 2sin* x), 
dx? dx 


one solution of the homogeneous equation is y(x) = sec” x, and the reduction 


of order method, which is somewhat painful, can be used (if you wish) to show 


2 The integral of sect = 1/cost is the ungainly In| sec ft + tant]. You can check this by differentiating, 


d 1 sint cos? t + sin? ¢ 
In(sect + tant) = x t 
dt sect + tant cos? t cos? t 
1 1 +sint 


= x 
(1+ sint)/cost cos? t 
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that a second linearly independent solution is tan.x. You should be able to find a 
particular integral explicitly for this example; 
(vii) 


(1 + sin? 1)¥ — (2tant + sint cost)x + (1 — 2tan*n)x = f(N), 


one solution of the homogeneous equation is tant, and again the reduction of 
order method will provide a second solution, cos f, after some effort. You should 
leave your final answer as an integral. 
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*Cauchy—Euler equations 


In this chapter we look at another general class of linear second order equations 
that we can solve in a systematic way. These are the Cauchy—Euler equations, 


gd°y dy 
ax cf aa tox ty =0. (19.1) 


We will see in the next chapter that an understanding of these equations provides 
insight that is useful when we try to find the solutions of more complicated equa- 
tions in the form of power series. 

There are two possible approaches that yield the solution of this problem. One 
method uses the substitution x = e* to reduce the equation to the more familiar 


2y 
are dr2 
which can then be solved by trying y(z) = e*, see Exercise 19.2. 


F+b- ae tey=0, 


However, the method we will use here is similar to the one that we would use to 
solve the constant coefficient equation 


For this equation we try y(x) = e**, because for this guess every term in the equa- 
tion is a multiple of e**. Now, note that (19.1) has a very special form; whenever 
there is a derivative the corresponding term is also multiplied by x, i.e. we have 
x’y" and xy’. The kind of function that when ea and multiplied by x is 
a aniepe of itself! is simply a power of x, y(x) = x* for some k. 
If we try y(x) = x* in (19.1) then, since 


yakx*! and yp" =k(k—1)x*?, 


! Tf this does not seem obvious then the correct form for y(x) can be found by solving the separable equation 
xy’ = ky, see Exercise 8.3. 
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we have 
ak(k — 1)x* + bkx* + cx* =0. 


Cancelling the factor of x* that occurs in each term we obtain the indicial equation, 
a quadratic equation for the index k, 


ak(k —1)+ bk +c =0. (19.2) 


As with the second order constant coefficient case, the types of solution that we 
obtain using this approach depend on whether the roots of the indicial equation 
(19.2) are real and distinct, repeated or complex. 


19.1 Two real roots 


If there are two distinct real roots k, and kz of (19.2) then this implies that y(x) = 
x" and y(x) = x* are both solutions of (19.1), and so the general solution is a 
linear combination of these, 


y(x) = Ax* + Bx, 
Example 19.1 Find the general solution of the equation 
2x*y"” + 3xy’-y =0 
We try y(x) = x*, and so 
2k(k — 1)x* + 3kx* — x* =0, 
which gives the indicial equation for k, 
2k? +k-1=0. 


This can be factorised as (2k — 1)(k + 1) = 0, and sok = 5 or k = —1, and the 
general solution is 


y(x) = Axe Bie 


19.2 A repeated root 


If the indicial equation (19.2) has a repeated real root k then this provides only one 
solution y(x) = x, However, we can use the reduction of order method to find a 
second, linearly independent solution. This will turn out to be y(x) = x* In x, and 
when actually solving an equation like this you should just write down the general 
solution 


y(x) = Ax* + Bx* Inx. 
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Any equation that results in a repeated root can be rewritten as 
xy” + (1 —2a)xy’ + A2y =0. (19.3) 


First we check that this equation really does produce a repeated root for the indicial 
equation; trying y(x) = x* yields (after cancelling the factors of x*) 


KES) AAS Wk ae = 0: 


which is k* — 2Ak + A* = 0, or (k — 4)* = 0. So we only obtain the one solution 
y(x) =x. 
Now we use the reduction of order method (see Chapter 17), trying y(x) = 


x*u(x). This gives 
y =xiul +Ax* ly and y= x? ul” + 2x87 1 WEIS Te 24 u, 


and substituting into (19.3), remembering that all the terms in which u(x) has not 
been differentiated will cancel, we obtain 


x? [xii + 2x? wT + 0 — 20) xh? ’] = 0, 


A+1 


After cancelling a factor of x*"" this gives 


xu’ +u' =0. 


If v = w’ then v' = —v/x; the solution of this equation can be found by separating 
variables, 

dv dx 

ene: 


Integrating both sides gives 
Inv = —Inx+C, 

and so v(x) = A/x. Since v = uw’, this implies that 

du _A 

Gy. 
and so u(x) = Alnx + B, which finally yields 

y(x) = Ax* Inx + Bx’, 

and we can identify the new solution as x* In x. 
Example 19.2 Find the general solution of the equation 


xy" + 3xy’+y =0. 
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If we try y(x) = x* then the indicial equation for k is 
k(k—1)+3k+1=0 


which is k* + 2k + 1 = 0, or (k + 1)? = 0. So k = —1 is a repeated root. It fol- 
lows that the general solution is given by 


y(x) = Ax! + Bx! Inx. 


Note that since (Inx)/x — Oas x — 0, all solutions tend to zero as x — oo. 


19.3 Complex roots 


If the indicial equation has complex roots, k = p + i then the solution is 
y(x) = x?[A cos(wlnx) + B sin(wInx)]. 


Although you should just write down the solution in this case, we now see how it 
can be derived from 


y(x) = CxP He 4. DxP He, 


We can understand x* when k is complex if we use the identity 

xk = gkinx, 
While this is clearly true if x is real (since x = elnx), it can also be used as a 
definition of x* if k is complex. If k = p + iw then we have 


xP tio = xPxio — x Pei lnx 


= x"[cos(wInx) + isin(wInx)], 
i? — cos@ + isin. From this formula it follows that x?~'@ is the complex 
conjugate of x?t', so in order to make our solution real we want 


since e 


y(x) = CxP He 4. CFP Tie 
where now C is complex, C = a + if. We therefore have 


y(x) = 2Re[Cx?t2] 
= 2Re[(a@ + iw)x?[cos(wInx) +isin(wInx)]] 


= 2x?[acos(wIn x) — B sin(wInx)], 


which, choosing A = 2a and B = —28 (they are both arbitrary constants) shows 
that 


y(x) = x?[A cos(wInx) + B sin(wInx)]. 
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Example 19.3 Find the general solution of the equation 


mye —xy’+5y=0. 


We try y(x) = x*, which yields 
k(K—1)—-k+5=0, 
ie. k7 — 2k +5 = 0. The roots of this equation are 


2+VJV4-2 
( ——p_ 0 ey 


So the solution of the equation is 


y(x) = x[A cos(2 Inx) + B sin(2 Inx)]. 


Exercises 


19.1 Find the general solution of the following equations, and also the particular solution 
satisfying the two specified conditions. 
(i) x*y” — 4xy’ + 6y = 0, y(1) = Oand y’(1) = 1; 
(11) ie "+ y=0, yd) = land y'(1) =0; 
(iii) t7% — Stx + 10x = 0; x(1) = 2 and x(1) = 1; 
(iv) ?7?¥ +tx —x =0,x(1) = 4£(1) = 1; 
(v) x22" + 3xz’ + 4z = 0, z(1) = O and z'(1) =5; 
(vi) xy” — xy’ — 3y = 0, y(1) = land y'(1) = -1; 
(vii) 417% + 8rx + 5x = 0, x(1) = 2 and x(1) = 0; 
(viii) x2y” — S5xy’ + 5y =0, y(1) = —2 and y'(1) = 1; 
(ix) 322 2" 4+ 5xz'—z=0,2(1) = 3 and z’/(1) = —1; and 
(x) #74 + 32% + 13x = 0, x(1) = —1 and £(1) = 2. 
19.2 If x = e then 


Show that 


d? d 
ax°— + bx—+cy= (E19.1) 
x x 
yields the linear equation 


2y 


aaa ae (oe aot cy =0. (E19.2) 


Exercises 


By solving (E19.2) find the solution of (E19.1) when the auxiliary equation 
ak’? + (b—a)k +c=0 


has 

(i) two distinct real roots k, and ky; 
(ii) a repeated real root k; and 
(iii) a complex conjugate pair of roots p + 


av 


Qo. 
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*Series solutions of second order linear equations 


We now consider how we might go about finding a solution of the second order 
linear equation 


a ST ee 
dx2 DS as qy = 


in the form of a power series! 


y= eo aye (20.1) 
n=0 


20.1 Power series 


Before we see how power series can be used to find solutions of differential equa- 
tions, we briefly recall, without proof, some of their basic properties. 

Whenever we consider infinite series, the issue of convergence becomes impor- 
tant. A power series is said to converge at a point x if the finite sums 


N 
) Anx” 
n=0 


a 


Throughout this chapter we only consider power series solutions in this form. Although it is possible to consider 
more general series solutions like 


y(x) = Do an(x — x0)" 
=0 


(an expansion ‘about x = xg’) it is always possible to convert such series to something in the form of (20.1) by 
making the substitution x = x — xg in the differential equation. 
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Series converges absolutely 


Ss 


Series diverges 0 Series diverges 


Radius of convergence p 


Fig. 20.1. Within the radius of convergence the power series converges abso- 
lutely; outside the series diverges, and on the boundary it may converge or diverge. 


tend to a limit as N — om, and it is said to converge absolutely if 


N 
So lanx”| 
n=0 


tends to a limit as N — oo. If a series converges absolutely then it must also 
converge, but the converse is not true. 

For every power series there is a number p > 0, known as the radius of conver- 
gence, such that the series in (20.1) converges absolutely for any x with |x| < p, 
and does not converge for |x| > o. When |x| = pe the series may converge, or may 
diverge, see Figure 20.1. 

Most important for us is the fact that while x lies within the radius of conver- 
gence it is possible to differentiate y(x) by differentiating the power series term- 
by-term so that 


[o,@) 


y/'(x)= > NAnx” 


n=0 


The resulting power series for y’(x) has the same radius of convergence as the 
original power series for y(x). 

Although there is no surefire way to find this radius of convergence, one ex- 
tremely useful method is based on the ratio test. The ratio test guarantees that a 
series converges absolutely provided that (the modulus of ) the ratio of successive 
terms is eventually less than one, 


axe?! 


anx" 


an+1 


lim 


n> CO 


= |x| lim 
n—> Co 


an 
It follows that the radius of convergence is given by 


an 


p= lim 
nC 


’ 


Gn+1 


provided that the limit on the right-hand side exists. 
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We can add and multiply power series together within their radius of conver- 
gence, and we will frequently use the fact that if 


oo oo 
Sons" = Jb 
n=0 n=0 


then a, = b, for every n. In particular, if the right-hand side is zero then a, = 0 
for all n. 


20.2 Ordinary points 


We first suppose that p(x) and q(x) are analytic. Essentially this means that they 
can both be expanded as convergent power series, 


(oe) (oe) 
P(X)= 0px" and ge) = Dan". 
n=0 n=0 


In such a situation, x = 0 is referred to as an ordinary point for the equation. In 
this case we can try a power series solution for y(x) of the form 


= dix” (20.2) 
n=0 


Assuming that we are within the radius of convergence of the power series we 
can differentiate term-by-term to find y’ and y”, then substitute into the equation 
and compare terms involving the same powers of x. 

We will start by finding the solution of a familiar equation in this new way. 


Example 20.1 Find the general solution of the equation 
yO y 


as a power series 


[e,2) 
ye) =). ae" 
n=0 


and hence identify two linearly independent solutions. 
Assuming that x lies within the radius of convergence of the power series, we 
have, differentiating term-by-term, 


CO 


Cc 
y(x) = So nanx”! = So nanx"! 
n=1 


n=0 
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(since the first term in the first sum is zero), and then 


CO CO 
y"(x) = Si n(n —l)anx"? = Sinn = age 
n=1 n=2 
Substituting into the equation y’ = —y we get 
(oe) (oe) 
Sinn =Dax == an 
n=2 n=0 


It is useful to rewrite both sums so that the index of the power of x is the same, 


[o.@) [o.@) 
Yi@t 2) + Danyax” = — Do anx". 
n=0 


n=0 


(The ‘summation variable’ 1 is a dummy variable, just like the variable in an inte- 
gration, so we can change it without affecting the value of the sum.) 

All terms contain every power of x from zero upwards, and equating the coeffi- 
cients of x” gives 


(n + 1)\@ + 2)an42 = —4n, 


or more usefully 


an 


FE GAN AED 


This provides a recurrence relation that tells us ay,+2 if we know dy. 

The way that this recurrence relation works means that if we know ag then we 
know a, for all even n, and if we know a, then we know a, for all odd n. So, 
as we would expect for a second order linear equation, there will be two arbitrary 
constants (ag and a;) in the general solution. 

First we consider the even coefficients: 

ao 1 ao ao 1 ao 0. 


HI Ge EE Bea oe ale ne == Gs 


so in general it looks like we have 


ao 


An = (—1)" ny!" 


(20.3) 


We should really check these general coefficients using induction, although we will 
only do so twice in this chapter, once here, and once in our final (and significantly 
more unpleasant) example. If we assume that (20.3) is correct forn = k, then the 
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recurrence relation implies that 


a2 1 (—1)kag 
(Qk+DQk+2)  (@k+D@k+2) (2b)! 
(— 2 or or 
(2(k + 1))! 


as required. Since (20.3) is correct when n = 0 it is therefore correct for all n. 
For odd values of n 


A2(k+1) = — 


a 1 a a l1 aq a 
a3 = a5 = — x - = ay = —— ‘ 
2x3 4x5 3! 5S! 6x75! 7! 


and in general we have 


a) 


een 
d2n+1 = ( Y Gn +! 


It follows that our full series solution is 


Oo an 
yo) =a [1-4 -+(-1? +] 


2 °° (2n)! 
is x3 de x? se 1y" ent m 
BEG 190 Qn+1! 


We can recognise the power series in the square brackets as those for cos x and 


sin x (see Appendix C), and so 


y(x) = agcosx + a, sinx, 


as we might have expected. These power series are known to converge for every 


x; we can easily check this using the ratio test. The ratio of two successive terms 


in the series for cos x is 


2n+2 2n Zz 
ht ou ~ ~ ae 
(2n +2)! oat (2n +2)(2n +1) 


which, for each fixed x, tends to zero as n — oo. Similarly, for the ratio of two 


successive terms of the series for sin x we have 


ent 2 


Ee yor = am fo" sy =X, 
(2n + 3)! (Qn+1)! (Qn+3)(Qn+1) 


The graph of cos x, along with the result of taking a finite number of terms in 


their power series expansions, is shown in Figure 20.2. 
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Fig. 20.2. A graph of cosx against x (the bold line) along with the sum of a 
finite number of terms of the series solution (n gives the highest power of x in the 
expansion). 
We now use the series solution method on a less familiar example.” 
Example 20.2 Find the general solution of the Airy equation 
I 


y” —xy =0 (20.4) 


in the form 


(oe) 
I= So ae 
n=0 


Assuming that x lies within the radius of convergence of the power series, we 
have, differentiating term-by-term, 


Ce (oe) 
y«)= Snag = So nanx" 
n=0 n=1 
(since the first term in the first sum is zero), and then 
(oe) (oe) 
y"(«)= Sinn —1)a,x"-? = Sinn espa. 
n=1 n=2 


2 Although both the Airy equation and Bessel’s equation (the subject of Section 20.4) may be unfamiliar, their 
solutions are standard functions, just as are sine and cosine. Indeed, Matvas has built in definitions of both the 
Airy functions (airy) and of the Bessel functions (besselj and bessely). 
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Substituting into equation (20.4) we have 


CO 


(oe) 
Sinn tao Si So anx" =0 
n=0 


n=2 
or, taking the factor of x inside the sum in the second term, 


[o,e) 


Co 
Sinn Sax = aa —(), 
n=0 


n=2 


Rewriting both sums so that the index of the power of x is the same we obtain 


Co Cc 
Son $.2)(n + Dangax"” — Yo an—1x" = 0. 
n=0 n=1 


A constant term only occurs in the first sum on the left, and this must be zero to 
match the right-hand side, from which we obtain 2a7 = 0, so az = 0. Otherwise, 
setting the coefficient of x” on the left-hand side to zero (in order to match the 
right-hand side) we have 


(n+ 2)(n + 1)dn42 — Qn—-1 = 9, 


which we can rewrite as 
an 


(a+ 3)@ +2) 


Now the recurrence relation tells us adn+3 if we know ap, so will give a3, a6, do, 


an+3 = 


etc. in terms of ag, and a4, a7, ajo, etc. in terms of a;. Since az = 0, it follows that 
a5 = dg =ayj =::: = 0. 
For the coefficients based on ag we have 


ao a3 ao a6 ao 
=> -_—— a= => ag = oF eee 
2-3 5-6 2-3-5-6 eBags Das~SuG cee 


and so in general 


a3 


Puce Git oak (Bn —1)-3n 


Similarly for the coefficients based on a; we have 


a| a4 a\ ay a\ 
Sd 6p Sedgey oc On10 smo ar son1O. 
and in general 

a| 


mice i ae ere eee 
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So the solution of the equation is 


3 x6 xan 
fel ss st 
y(x) wli+ sat sase+ 1 Dears Gn—1)-3n" 
Bigfal eee e ea ae 5 
a gilli WORT RE WR BAe se < 3n - (3n +1) 


Note that this gives the solution in the form 
y(X) = agA1 (x) + a1 A2(x), 
i.€. aS a Superposition of two (we presume) linearly independent solutions Aj (x) 
and A2(x), each of which is given as a power series: 
3n 


[e,2) 
Xx 
Ai(x) =1 
1(x) + hia cea (n —1)-3n 


and 
ont 


(oe) 
A = , 
2(x) ge eer er 3n- (3n+ 1) 


The ratio of successive terms in Aj (x) is 
73+) xn 


2-3-(3n—1)-3n-(3n+2)-(n4+3)/ 2-3----> (3n — 1)-3n 


= Ix|? 

Bn +2)Bn +3)’ 
so that whatever the value of x this expression tends to zero as n —> ov. It follows 
that the series converges for every value of x. A similar analysis shows that A2(x) 
also converges for every x. 

The graph of A(x) is shown in Figure 20.3, along with the approximations 
given by taking a finite number of terms in the corresponding power series. 


The standard forms of the Airy functions, known as Ai(x) and Bi(x), are linear 
combinations of A;(x) and A2(x); Ai(x) is chosen such that Ai(x) > 0 as x > 
oo and Bi(x) such that the Wronskian W[Ai, Bi](x) = 1/a. 


20.3 Regular singular points 


There are many important equations that can be written in the form 


y" + p(x)y' +. q(x)y =0 (20.5) 
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3 


Fig. 20.3. The graph of A(x) against x shown as a bold line, along with the 
values of some of the finite sums. 


in which p(x) and q(x) are not analytic, i.e. cannot be expressed as a power series. 
The simplest example is the Cauchy—Euler equation 


MW 


x*y" + poxy’ + qoy =0 (20.6) 


from the previous chapter. Dividing through by x” gives 


Poy 4 Py =0, (20.7) 
x x 


ye + 
and so p(x) = po/x and q(x) = qo/x* cannot be expanded as power series. 
However, we saw in the previous chapter that (20.6) has solutions of the form 
y(x) = Ax°, for an arbitrary constant A, where o has to satisfy the indicial equa- 
tion 
o(o —1)+ poo +qo=k. (20.8) 
We can extend the power series method to equations in which p(x) is ‘no worse 
than po/x’ and q(x) is ‘no worse than qo/x~’, which are the ‘bad’ factors occurring 
in (20.7). 
A point x is known as a regular singular point for equation (20.6) if xp(x) and 


x°q(x) are analytic (i.e. have convergent power series expansions). In this case 
p(x) and q(x) can be written as 


PO 
p(x) = + pit pox + pax +++ 


0 1 
qe) = + Santas tqar te. (20.9) 
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To find a power series solution in this case we replace the arbitrary constant in our 
solution y(x) = Ax° of the Cauchy—Euler equation by a power series, and look 
for a solution in the form 


[oe e.e) 
y=" Ya =) ax. (20.10) 
n=0 n=0 


The appropriate values of o will once again be determined by an indicial equation, 
as we now see. 
Provided that we are within the radius of convergence of the series we have 


e.2) 
ya) = Dann + o)xorr! 
n=0 


and 
(oe) 
y"(x) = So an(n we lincoln, 
n=0 


Substituting these into y” + p(x)y’ + q(x)y = 0 and using the expansion of p(x) 
and q(x) in (20.9) gives 


[o.@) 
So an(n Pontos ler 
n=0 


[e,2) 
+ [+ pit pox t px? +--+] oan to)x7*"! 20.11) 
n=0 


[e,2) 
+ (54 Star tase taax? +] Vane = 0. 
x x n=0 


Looking at the coefficient of the lowest power of x, x°~, and setting this to zero, 
we can show that we need oa to satisfy the familiar indicial equation 


o(o —1)+ poo + qo = 0, (20.12) 


cf. (20.8). We might expect that if we have two distinct roots 01 > o2 of (20.12) 
then this would give us two linearly independent power series solutions of our 
differential equation, and this will be the case provided that o; and o2 do not differ 
by an integer. 

To see what the problem is if the roots differ by an integer, we look at the coef- 
ficient of x°+”~* from equation (20.11), which is 


[n+o)n+o-—-1)+ poa+o)+qolan + terms involving ao,...,dn—1 = 0. 
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If we rearrange this to find a recurrence relation for a, we get 


= terms involving do,...,@n—1 
— (nto)n+o—1)+ pon+o)+qo 


an 


The denominator of this equation will be zero if n +o solves the indicial equa- 
tion (20.12). For the larger root, o;, the denominator will never be zero, so we will 
obtain a solution. However, if the roots differ by an integer N then o7 + N = oj, 
and so the recurrence relation for the series involving 0 = o2 will run into a prob- 
lem when n = N (the denominator will be zero) unless there are happy cancella- 
tions which also make the numerator zero for this value of n. 

Thus if there is a repeated real root, or (in general) if the roots differ by an 
integer, we will only be able to find one solution in the form of the power series 
(20.10), 


Ce 
yo =). aa 
n=0 


However, by making our ‘guess’ a little more complicated it is possible to find 
the second solution. Our analysis of the Cauchy—Euler equation in the previ- 
ous chapter provides a clue as to the form of this guess. There we found that 
if we had a repeated root o of the indicial equation then the general solution 
was 


y(x) = Ax? Inx + Bx®. (20.13) 


Once again we replace the arbitrary constants by power series; we replace Ax° by 
our expansion for yo(x), and B by a new power series whose coefficients we have 
to find, 


CO 
yy (x) = yo(x) Inx + os bpxot”. (20.14) 


n=0 or 1 


If there is a repeated root o then the sum on the right-hand side is taken from 1, 
while if the roots differ by an integer o = 02 is the smaller root, and the sum is 
taken from zero.* The algebra involved in substituting (20.14) into the equation is 
usually fairly daunting, and we will only consider one relatively simple example 
in what follows. 


3 Taking the sum from zero in the case of a repeated root will add a multiple of yg(x) to the solution y; (x) and 
complicate the algebra. 
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20.4 Bessel’s equation 


We consider how this method applies to ‘Bessel’s equation of order v’, 


dy 
ares wt + (x7 — v*)y =0. (20.15) 


By choosing various different values of v we will be able to produce examples of 
all the different possibilities outlined above. 
If we divide by x then we obtain an equation in the standard form, 


dy 1 dy v 

See P(e ya, 

dx . x nena 2) 
p(x) q(x) 


from which we can see that x = 0 is a regular singular point, since although neither 
p(x) nor q(x) is analytic, xp(x) and xq (x) are. However, it is more convenient 
to work with the equation in the form (20.15). 

We try a solution 


C 
yay= > aux, 
n=0 


and so within the radius of convergence 


y'(x) = S>an(o ah n)xotn! 


n=0 
and 
[e,2) 
y"(x) = So an(o +nj\(o+n—1)x?t? 2; 
n=0 
Substituting this into (20.15) gives 


[o.@) [o.@) 
So an(o +n)(o +n —1)x°™" + So an(o ae ye 
n=0 n=0 


+m ax? oY aga 


The coefficient of x° gives 


ago (o — 1) + ago — vag = 0, 


2 


which yields the indicial equation o? = v~. The coefficient of x°*+! gives 


ai(o + l)o +a;(o + 1) — va; = 0 (20.16) 
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and so a, = 0 whatever the value of v. For n > 2 the coefficient of x°*” is 
an(o tn)(o +n—1) +an(o +n) + an_2 — v7an =O, 


yielding the recurrence relation 


Fi act a 
: (n+o)2 — v2" 
Since o = +v this gives 
an—-2 
= 20.17 
oe n(n + 20) ( ) 


Example 20.3 Find two series solutions of Bessel’s equation when v = 7 


The indicial equation has roots 0 = +3, and these differ by 2/3, which is not 
an integer. So we should be able to find two power series solutions, 


of 1 oO. 1 
aes and a ae 
n=0 n=0 


For the first series 0 = ; and the recurrence relation in (20.17) becomes 


aAn—2 = 9an—2 
n(n+2/3) = 3n(3n +2) 


an = 


Since a; = O the recurrence relation shows that a, = O for all odd values of n, and 
we have 


9ao 9ar 97.49 a9 
== == == = 
Be 68 oO al. eet -° = ~esenOr iba aai4 
and in general 
9" ao 5\7) ag 
a2, = (—1)" —————————_- = (-1)" [ = ————S 
6-8+--- 6n - (6n + 2) 2 3-4-...- 3n- (3n + 1) 
So we have one solution 
— aia" 
. 2 ely = ni f2N™ 
jie) =x d¢ ay eC 
For the second solution with o = —} the recurrence relation is 
bn—2 9bn-2 


aa 2/3) 3n@n —2)’ 


and so 
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Fig. 20.4. Graphs of j1/3(x) (top) and j_1/3(x) (bottom) against x along with 
their approximations from series with 4, 6, 8 and 10 terms. 


bp = 200 yp, 902 oe 9°bo 
oe PTO AG AOSD. > ALG TAO o 16 nds: 


9"b 3\" b 
bon = (—1)" = cay" ( ) 2 
re eae (6n — 2)-6n Dy) OD aaa (n —1)-3n 


giving a second solution 


. pe (3x/2)" 
— 1/3 —])" 
j-13%) =x 2 I FR Ga eon 


Graphs of j/3(x) and j_1/3(x) are shown in Figure 20.4. 


It is interesting to note that these functions are related to the solutions of the 
Airy equation. For example, we had 


3 6 


7) ed ee ee zi re 
x)= ote aan 
' Deo Dees De ree (Bn —1)-3n 
aoa aye § 3/24 53/220 
cae eae (Gix?*) ee eye (baes tes 
ee a ee Gn — 1) -3n 


2.3 
=a “. 3/2 ; 
J-1/3 & ) 


Just as there are standard choices for the two linearly independent solu- 
tions of the Airy equation, there are standard normalisations for the series 
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solutions of Bessel’s equation. The Bessel function J,,(x) (returned by MATLAB’s 
besselj(nu,x)) is equal to 2"'(1 + v) times the series solutions j,(x) that 
we obtain in this chapter, where I(x) is the gamma function (a generalisation 
of the factorial function defined by the integral '(z) = te t?—le~! dt, see Exer- 
cise 20.8). 


Example 20.4 Find two series solutions of Bessel’s equation when v = 5: 


In this case the indicial equation has roots 0 = +5, and these differ by an inte- 
ger. Foro = ! the recurrence relation in (20.17) gives 


an—2 
a, = ——__.. 
n(n + 1) 
As before all odd coefficients are zero, 
ao ao ao 
a) ee = pee 
aT) ase. ada: See a 
and the general coefficient is 
ao 
= (—1)” —_—_—_.. 
an = (I) Gap! 


It follows that 


lee) ss xn 4 f 
y(x) = ao par aah 


n=0 
_ 0. px ween 
aie = (2n + 1)! 
sin x 


ae 


and one solution of the equation is j1/2(x) = sinx/,/Xx. 

We might suspect that the other solution is cos x /,/x and this in fact turns out to 
be the case. This is an example in which there are “happy cancellations’ that allow 
us to find a second solution using the series method even though o; and o2 differ 


by an integer. For 0 = —5 the recurrence relation in (20.17) becomes 
an—-2 
= Se 20.18 
an AGE 1) ( ) 


and the ‘happy cancellation’ arises since a, is always zero (this followed from 
(20.16)) and so we never need to use (20.18) with n = 1 (which is where the 
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Fig. 20.5. The two solutions j1/2(x) = sinx/./x (solid), and j_1/2(x) = 
cos x/./x (dashed), plotted against x. 


problem would arise). For this second solution we get 
ag ao ao 
ay = SS a4 = a! = 


and in general a2, = (—1)"/(2n)!; we have 


0 


As we suspected, there is a second solution j_1/2(x) = cos x/,/Xx. 
These solutions are illustrated in Figure 20.5. 


Example 20.5 Find a series solution of Bessel’s equation when v = 0, 


ary” +xy'+ x7y = 0. (20.19) 

In this case the roots of the indicial equation are both o = 0, so we can only 

find one solution in the form of a simple power series. The recurrence relation in 
(20.17) becomes 
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Fig. 20.6. The graph of Jo(x), plotted against x, along with the sum of the series 
with 4, 6, 8, 10 and 12 terms. The series with 14 terms cannot be distinguished 
from Jo(x) over this range of x values. 


Since a; = 0 all odd coefficients are zero, and 


ag ag ag 
sa a a a OT 
It follows that 
(—1)"ao (—1)"ao 
a2n = 


~ 2242... (2n)2 22 (nt)2” 
Thus we have 
Cc 
(=1)* «5 
yo(x) = ao bp Pr (nh)? ~ ma 
n=0 


with the convention that 0! = 1. The quantity in the square brackets is Jo(x), the 
Bessel function of the first kind of order zero. A graph of Jo(x), along with its 
approximations by taking a finite number of terms from the power series, is shown 
in Figure 20.6. 


We end this chapter with a somewhat more painful calculation of the second 
solution for Bessel’s equation of order zero. 


Example 20.6 Find a solution for Bessel’s equation of order zero in the form 


y(x) = Jo(x) Inx + ee 


n=1 
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First we calculate 


/ / Jo(x) = n—1 
y (x) = Jo(x) nx + 7 + So nbnx 
n=1 


and 


JIN(x J 2 
y"(x) = Jj (x) Inx + 20) _ fo) + ) n(n — 1)byx"~?. 
x x2 = 


When we substitute these expressions into Bessel’s equation of order zero, 
x?y" + xy’! +x7y = 0, all the terms involving Jo(x) cancel except one, and we 
end up with 


(oe) (oe) CO 
2x Jg(x) + Dnt — Wbpx” +) bax” + D> byx”*? = 0. 


n=2 n=1 n=1 


Since 


oO _qyr 
Jo(x) = Ys a 


Xx 
2) 2 
Ls 2n(n!) 


it follows that 


CO 
(—1)"2n 4 
/ = n 
xJo(x) = yi Dnt” P 
n=1 
Therefore we have 
Cc Cc —] ng 
byx + Abox? +S In2by + bya)” = —2 99 LP 20, 
al 


=> xX 
2 Z 
& 227 (n!) 


Since only even powers occur on the right-hand side it follows that b; = 0, and 
that for n odd 

nbn + bn—2 = 0, 
which implies that b, = 0 for all odd indices n. When n is even we have bz = 1/4 
and for n > 2 the coefficients of x~” give 
; Cnn 
(2n)“bon + ban-1) = ~ 72D nh? 


which is 


1 aril: 
boy = 52,2 —b2(n-1) = 22-1 (n!)2 
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eae! ne: eae Oy Ae 
see ee ae ly ae oa 


then 


ee Seip Vf oat | fe 
6 92.32 | 2242 2) ° 2482] 224262 ar 


It looks as though we have 


Goce aE fiona 
on 92n(nb)2 2 n}’ 


although this is one solution that we should check by induction. Assuming that this 
is correct for b2;, we have 


Bye GD gl TV Dee) 
bo(k+1) = 22(k + 1/2 |- 22k (k!)2 (1 lr 5) i 2) = Vk ((k + 1)!)? | 


2.60 em (1 ene ) i 
~ 22(k +1)2 | 22k(k!)2 2° kd) PREDAK 4D 
: (—1)*+2 ; 1 1 1 

~ D297K(K + D2(kI I te z) ay ae | 


eet eae 
— Q2K+D ((k 4 1)1)2 2 k+1/)’ 


as required. Writing H,, for the sum of the first n terms of the harmonic series 


we therefore have the second solution 


An 2n 
2 (nly2” 


yo(x) = Jo(x)Inx + )*(-1)"" (20.20) 
n=1 


This second solution, which blows up as x — 0 like In x, is shown in Figure 20.7. 

The standard normalisation of this second solution means that the “Bessel func- 
tion of the second kind of order zero’, Yo(x), returned by the Mat Las function 
bessely(0,x), is somewhat surprisingly given by 


2 
Yo(x) = _ bo) + (y —In2)Jo(x)], 
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Fig. 20.7. A second linearly independent solution for Bessel’s equation of order 
zero plotted against x, and its approximations by Jo(x) Inx + S,, where S,, is the 
series part of (20.20) with 2, 4 and 6 terms. 


where y is the Euler—Mascheroni constant defined by 


20.1 


y = lim (A, — Inn) © 0.5772. 
n—->cCo 


Exercises 


You may find the following two identities useful for these exercises: 


2-4-6+---- 2n = 2"n! 
and 
_ Qn)! 
1-3-5----- (2n — 1) al 
Legendre’s equation is 
(1 —x7)y” — 2xy’ +10 + Dy = 0. (E20.1) 


If y(x) is given by a power series, 


CO 


yax)= yas" 


n=0 


find the recurrence relation satisfied by the coefficients a,. Show that if / is a positive 
integer then there is a solution given by a power series that has only a finite number 
of terms, i.e. a polynomial. For each value / = 1, 2,3, and 4 find the polynomial 


solution that has y(1) = | (these are the ‘Legendre polynomials’ P;(x)). 
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20.3 


20.4 


20.5 


20.6 
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Find two independent power series solutions of the following equations, and use the 
ratio test to find their radius of convergence. 
(i) y’ —xy’ + y = 0 (cf. Exercise 17.5), 

Gi) 1+x7)y’+y =0, 

(iti) 2xy” + y’ — 2y = 0 (you should be able to sum the two power series to obtain 
explicit forms for the two solutions), 

(iv) y” — 2xy’ + 2ky = 0. By finding the recurrence relation for the coefficients in 
the power series identify those values of k for which one solution is a polyno- 
mial. Find both solutions when k = —2 and k = 2; in each case you should be 
able to find a simple expression for one of the two solutions, while the other 
can be written as a power series whose general term you should be able to find 
explicitly. 

Find one power series solution of the equation 


x(1—x)y"” —3xy’-y=0. 


You should be able to sum this power series to write down the solution explicitly. 
Now use the reduction of order method to find a second solution. 
Find one series solution of the ‘modified Bessel equation’ 


x2 y" fe xy! ane *y= =: 
Find a series solution for Bessel’s equation of order one, 


xy” + xy’ + (x? - Dy =0. (E20.2) 


You should obtain 


(— 1)"x 2n 
yx) = ee) aia ere 7 Din: 


with the choice c = 1/2 this gives the standard form of the Bessel function J; (x), 


22 (-1)" x\2nt+l 
A@ = ee pini (a) 


In order to find a second solution of (E20.2), substitute 


y(x) = Ji(x) nx + — |S x a 


where Jj (x) is the series solution from the previous question, to show that 


By oo k 
(-1)* (2k + 1) (x\ 2k! 
bi +b * = bagi + by—1]x" = —2 ) 
1+ oe Dele busi + bai \x 2d (k+1)!k! c 


Hence show that by) = —1, b; = 0, and that b, obeys the recurrence relation 


(n? — 1)bn41 + Da-1 = 0 


20.7 


20.8 


20.9 
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if n is even and, fork = 1, 2,3,..., 


(—1)* (2k + 1) 


2k +1)* —1]b i Soe 
[(2k + 1) ]bocK41) + bx E(k + 1k! 


(E20.3) 


Deduce that b; = 0 for all odd values of j. 
Denoting by H,, the sum 


verify that 


_ (1A + Ae-1) 
ah DREN = 1) 


solves (E20.3) and hence write down a second solution of (E20.2). 
Show that when n is a positive integer one solution of Bessel’s equation 


dy dy 2 
x 
dx? dx 


can be written as the power series 


ILO joes ‘ie (E20.4) 
j=0 


n’)y =0 


j!n+ fj)! \2 


(T) The gamma function generalises the factorial function to values that are not 
integers. For any real number z we define 


[o,@) 
T(z) = / i let de. 
0 
Integrate by parts in order to show that for a positive integer n 
Tian+1)=nI(n). 


Since (1) = 1, deduce that ['(m + 1) = n!. (Using the gamma function in place of 
one of the factorials in the power series (E20.4) gives 


oe 1 X\vt2j 
I(x) = di IY FO aye 1) G) ‘ 


and this formula now applies for any real number v. This is where the strange nor- 
malisation of J, for non-integer v comes from, see comments after Example 20.3.) 
(C) Write a short program to generate the coefficients in the power series expansion 
of J,(x) for any value of v using the recurrence relation (20.17). Investigate how 
many terms of the expansion you need to take in order to approximate the solution 
well on a fixed interval (0 < x < 10, say). (You might like to look at the M-file 
besselseries.m, which produced the Bessel function figures in this chapter.) 
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20.10 (T) The Bessel functions might seem exotic, but they arise very naturally in prob- 
lems that have radial symmetry. For example, the vibrations of a circular drum sat- 
isfy 


au «1:8 (Ou 1 07u 
= ; E20.5 
ar2,—sr Or (: =) rt r2 002 ( ) 


where u(r, 0, t) is the displacement of the circular skin of the drum at a point ex- 
pressed in polar coordinates. In the method of separation of variables we look for a 
solution of the form 
u(r, 9, t) = RY)O@)T(), 
and try this guess in the equation. Substitute this in to (E20.5) and show that 
Lee... cd ( <) 1 &o 


Td2@ rRdr\ dr) 726 de2- 


(E20.6) 


The left-hand side of this equation is a function of ¢ alone, and the right-hand side 
a function of r and 9, so in order to be always equal they must both be constants. 
Choosing 

127 

Td? 
(there are good physical reasons for choosing this constant to be negative) show that 
we can rearrange (E20.6) to give 


ee ates (E20.7) 

= r rk’. : 
© dé? Rdr dr 

Now the left-hand side is a function of 6 alone, while the right-hand side is a func- 

tion of r alone; so both sides must be equal to a constant. Now we choose 


1”?o_ 4 
@ de> 
(again there are good physical reasons why this constant should be positive); show 
that in this case (E20.7) can be rearranged to give 
@?R dR 
2 212 2) p 
r qe tte te k*—v \)R=0. 
Finally substitute x = rk to show that R satisfies Bessel’s equation of order v, 
@?R dR 
2 2 
Ge ax 


v)R=0. 


Part IV 


Numerical methods and difference equations 
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Euler’s method 


If we have a differential equation that we cannot solve analytically then we can 
always try to treat the problem numerically. In this chapter we consider one nu- 
merical way to approximate the equation 


dx 
ae = f(x,t) x(0) = xo. (21.1) 

The first step is to give up the idea of finding a solution for all values of the 
independent variable (x(t) for any ¢ € R) and instead try to find an approximation 
to the solution at a discrete set of values of t. This is illustrated in Figure 21.1. 

In the simplest case we try to approximate the solution at equally spaced values 
of the independent variable. This means, for example, that we want to approximate 
x(h), x(2h), x(3h), x (4h), etc. The difference between two successive times, here 
h, is called the timestep (or more generally, when the independent variable is not 
necessarily time, the step size). 

If we have a method for approximating x(t + hf) given x(t), then we can apply 
the method repeatedly to find approximations for x (nh) for any n, if x (0) is speci- 
fied initially; first we approximate x (i) = x(0 + h), then using our approximation 
for x(h) we approximate x(2h) = x(h +h), then x(3h) = x(2h +A), etc. 

In this chapter we will introduce the simplest numerical method for producing 
approximations to the solutions of (21.1). 


21.1 Euler’s method 
The differential equation 
dx 
ap = f(x,t) (21.2) 
tells us that at time t = s, the rate of change of x(t) is f (x(s), 5). If the timestep h 
is small enough for us to assume that this derivative changes little between t = s 
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Fig. 21.1. The curve shows a notional ‘true solution’, and the crosses show the 
result of numerically approximating the solution at a discrete set of values of t. 


X(s + h) 


s s+h 


Fig. 21.2. Euler’s method. The curve represents the exact solution x(t), the bold 
line has slope f (x(s), s), and the cross gives the Euler approximation to x(s + h) 
given x(s). 


and t = s +h, then we can pretend that x(t) is actually constant over this interval, 
and so we can make the approximation 


x(s +h) + x(s) +h f(x(s),s). (21.3) 


The effect of doing this is shown in Figure 21.2. 
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The expression in (21.3) is also what you would get from keeping only the first 
two terms in the Taylor expansion of x near time tf = s, 


x(s +h) ¥ x(s) +hx(s) = x(s) +hf (x(s), 5), 


since x(s) = f(x(s), 5) (see Appendix C). 

In order to have a more compact notation, we can write t, = nh and x, = x(t). 
The approximation in (21.3) then gives rise to Euler’s method of numerical solu- 
tion, 


Xn+1 =AXn +h f (Xn, tn) with xo = x(0). (21.4) 


This is a simple example of a difference equation, where the continuous vari- 
able (t) has been replaced by a discrete index (n). We will treat difference equa- 
tions more systematically in the next chapter, but for now we will consider the 
application of Euler’s method to some simple examples. 


21.2 An example 


Because Euler’s method is so simple, it is possible to apply it “by hand’. Suppose 
that we want to approximate the solution of 
dx 5) 


—=t-x 


ze x(0) =0, (21.5) 


at time t = 2. This is an equation whose solution cannot be found explicitly, hence 
the need for a numerical method. An accurate numerical value of x(2) is x(2) = 
1.1936 (correct to 4 decimal places), and we will try to reproduce this with Euler’s 
method. 

Using the method with a timestep h = 1 we have t, =n, xX, = x(n), and 


Xntl = Xn thf (Xn, tn) 
= A(tn — ie) 
= Xn + (n = on 

So we have 
x4 = xo + (0— x9) =0 
x2 =X] + (1 — x7) = 1, 
which gives the value x(2) ~ x2 = 1. That this approximation is not very accu- 


rate is unsurprising, since we have assumed that the derivative of x(t) is constant 
between t = 0 andt = 1, and between t = 1 andt = 2. 
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However, we can do much better if we use the timestep h = 1/2. In this case we 
have t, = n/2, Xx, = x(n/2), and 


Xn+1 = Xn + hf (xn, tn) 


So we have 
x1 = x9 + 5(0— 2x9) =0 
saat Hd — af) =04 45-07) =] 
san ti(- x)= $438) = 8 
2 
uan+iQ-x)=9+10-@)) 
33 + 3(5 — usa) = 35 + 20m 
834 
689 


The final stage here, x4 = 834/689 ~ 1.2104, is our approximation to x(2), and 
significantly more accurate than the result for h = 1. 

Although this is already slightly beyond the limit of what is comfortable to 
calculate by hand, a computer can happily apply the method with much smaller 
timesteps. As the timestep is made smaller, the distance between successive times 
decreases. As this happens the assumption that x is constant between times ¢, and 
tr41 becomes more and more accurate, and so the approximation to x(t) becomes 
better and better. Figure 21.3 shows the approximations of the true solution of 
(21.5) for various values of h over the time interval 0 < t < 2. 

It appears that the Euler method has served us extremely well, even with the 
relatively large timestep ) = 1/2. However, if we try to use this method to ap- 
proximate the solution on a longer time interval, say 0 < ¢ < 11, then Figure 21.4 
shows that we can start to run into problems. The Euler method continues to ap- 
proximate the solution extremely well until around t = 8, but then the errors begin 
to accumulate and the numerical solutions starts to oscillate wildly, while the true 
solution continues to increase relatively gently. 


21.3 *MATLAB implementation of Euler’s method 


Euler’s method is easy to implement computationally. The MATLAB code re- 
quired to apply the method to equation (21.5) is given below (and is available as 
euler.m on the web). Also included are two commands to plot the numerical 
results graphically, as in Figures 21.3 and 21.4. 


21.3 *MATLAB implementation of Euler’s method 


Fig. 21.3. The solution of equation (21.5) as calculated by Euler’s method, for 
h=1,h=1/2,h= 1/4, h = 1/8; the numerical values are marked by crosses. 
Also shown is the ‘exact solution’ in bold. (This ‘exact solution’ is in fact the 
result of Euler’s method with h = 27!°.) 


0 2 4 6 8 10 2 
t 
Fig. 21.4. The solution of equation (21.5) as calculated by Euler’s method with 


h = 1/2. As before the numerical values are marked by crosses, and the exact 
solution is shown in bold. 
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%% Euler’s method 


T=2; $% final time 


h=0.5; $% timestep 


MATLAB does not allow an index 0 


on a vector, so x_n is x(n+1) here 


initial time 


initial condition 


t(n+1)=n*h; 

x(n+1)=x(n) + h * (t(n)-x(n)%2); 
end 
[er se] %% display values 


%% Plot crosses at numerical values, and join these 


plot(t,x,’x’, ’MarkerSize’ ,20) 
hold on 
plot (t,x) 


This program outputs the following values: 


t = 0 0.5000 1.0000 1.5000 2.0000 


xX = 0 0 0.2500 0.7188 1.2104 


and produces the graph of Figure 21.5. 


21.4 Convergence of Euler’s method 


We can investigate how Euler’s method behaves when we make the timestep 
smaller if we apply the method to an equation whose solution we already know. 
In this case we can more easily compare our approximate numerical solution with 
the exact solution. 


21.4 Convergence of Euler’s method 207 


Fig. 21.5. The graph produced by the MATLAB code of Section 21.3. 


We will apply the method to the simple linear equation 


dx : 
—=xX with x(0) = 1, 
dt 
for which we know the exact solution x(t) = e’. 
If we use a timestep / then t, = nh, x, = x(nh), and Euler’s method gives 
Xnt1 = Xn thx, = A +A)Xn with xo = 1. 
It is easy to find the solution of this equation, since 
xp =U +h)xo = +h) 
xy =(1+h)xy = (1 +A) +h) = (1 +h)" 
x3 = (1+h)x = (+A) +h =A +h), 
and so in general! Xn = (1+h)". 


Since x, is an approximation to x(n/h), the approximation we obtain for x(t) is 
Xt/n (where we assume that t/h is an integer), 


x()~dtany/", 


Since t = nh we can replace h by t/n, and so 


t n 
x(t) = (1 + “) 5 
n 


! This can be checked using induction. The induction hypothesis is that x, = (1 +h)”. Assuming that this is 
true for n = m, it follows that x4; =(+A)xm =(1+A)14+h)" =(4 hynt], and so the hypothesis 
holds for n = m + 1. Since x9 = (1 + hy? = 1, induction shows that this is the correct solution. 
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Fig. 21.6. The bold line is the exact solution x(t) = e’, while the other lines show 
the results of the Euler method with successively smaller values of h, from h = 1 
to h = 1/32, with crosses marking the numerical values. 


If we keep x fixed and let h — 0 then since n = t/h we need to letn > 00. A 
standard result from analysis (see Exercise 21.4) guarantees that 


Go aa 
1+-] -e (21.6) 
n 


as n —> oo, and so as the timestep is refined our numerical approximation does 
tend to the exact solution. This is illustrated in Figure 21.6, which shows the exact 
solution and the solution obtained using Euler’s method with various values of h. 

Although we have shown that the Euler method works for this particular exam- 
ple, if it is to be a truly reliable method then we should have a proof that whatever 
the equation, if the timestep is small enough then the numerical solution will be a 
good approximation. The mathematical discipline of numerical analysis deals with 
such problems. For example, suppose that f(x) is a function that satisfies” 


If@) —fO)| s Llx - yI 
for some constant L, and x(t) is the exact solution of 


dx 0) = 
qt x(0) = yo. 


2 This condition, that f be a Lipschitz function, is in fact what is required to ensure that the equation x = f(x) 
has unique solutions, see equation (6.5). 


Exercises 209 
Now, if x, is the solution of the Euler method 
Xn+1 = Xn thf (Xn) with x9 = yo 
then x, should approximate x (t,,) (where ¢, = nh). Writing x(f,) for x,, given any 


T > Oitis possible to prove the error estimate 


max |X(ty) — x(t)| < Kh, (21.7) 


O<t, <T 


for some constant K (depending on 7). This says that as h is made smaller, the er- 
ror between the approximation and the true solution can be guaranteed to decrease 
over the whole time interval. 

Because the Euler method is relatively simple, as / is made smaller the error 
decreases fairly slowly; halving h will only halve the error. Numerical methods 
that are used in practice generally have much better error properties. One popular 
method is the Runge—Kutta scheme, where 


h 
Xn+1 = Xn + acd + 2fo+2f3+ fa) 


with fi,..., f4 given by 


fi = fn, tn) 

fo = F (Xn + SAP. tn + 5h) 
fs =f (tn + Zhfe tm + 3h) 
fa= f Gat hfsety +4). 


Although this method appears much more complicated, implementing such a 
scheme computationally is fairly straightforward, and now the error satisfies 


max |X(tp) — x(ty)| < Kh’. 
Stn<T 


This means that halving the timestep will increase the accuracy of the method by 
a factor of 16. MATLAB’s ode45 routine uses a refined version of this method. 


Exercises 


21.1 Apply Euler’s method to the general linear equation x = Ax. Find the approximation 
Xn, and using (21.6) show that as h — 0 the numerical solution converges to the true 
solution. 

21.2 There are variants of the Euler method that have the advantage of better stability prop- 
erties, but have the disadvantage of no longer being explicit schemes. For example, 
the backwards Euler method is 


Xnt+l = Xn + hf (Xn41, tnt), 
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21.3 


21.4 


21.5 
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which has to be solved at each stage to find x,+1 in terms of x,. Apply this method 
to the linear equation x = x, and show that once again the method converges to the 
true solution x(t) = e! as tf > oo. 

Another variant of the standard Euler method is the trapezoidal Euler method. If x(t) 
is the solution of x = f(x, t) then we have 


t+h 
x(t +h) =xo+ f Ff (x(s), 5) ds. 
t 
Use the trapezium rule to approximate the integral to derive this scheme, 


Xn+1 = Xn + hls f On, tn) + sf Ont, tn+1)]. 


(T) Since (d/dx)e* = e*, if we calculate the derivative of e* at x = 0 as a limit it 
follows that 


By rearranging this (note that 1 = limy-_,9 1) show that 
e= lim(1+a)!/", 
h->0 


and hence that 


(Hint: if 
li h) = ii h)=y, 
Him i Sle By 
and « (x) is continuous at x = y, then 
li h)) = li h)}. 
im «LF (A)] = lim x[e()] 


You will need to use this once for each step.) 
(T) In this question we suppose that f satisfies the Lipschitz condition 


lf) — fQ)| < Llx — y| 


and consider the Euler 6-method for approximating solutions of x = f(x), 


Xnt1 = Xn + hl — 6) fn) + Of n+). 


For 6 = O this is the standard Euler method; for 6 = 5 this is the trapezoidal method; 
and for 9 = | this is the ‘backwards Euler’ method. Since x, is not given explicitly 
as a function of x,41, we need a reliable way of calculating it numerically. 
(i) The first thing we must check is that there is a unique solution for x,_,. Suppose 
that 


y=Xn +h[ — A) fan) + Of] and 
Z= Xn t+h[1 — A) fn) + OF @)I, 
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i.e. that both y and z satisfy the equation. By subtracting these two equations 
show that 


y—z=6h[f(y) — f@I, 
and hence deduce that 
ly—z| <ALOly — zl, 


and therefore that y = z provided that h < 1/L0. 
(ii) Suppose therefore that h < 1/L6@. Given an initial guess yo for x,+1, we can 
refine this guess successively by setting 


Yj = Xn +h — 6) fn) + OF pI (E21.1) 
if yj41 = yj = y then 
y=x% +All — 8) fn) + Of) 
and so y would be the required value for x,+1. Show that 
Iyj+1 — yjl S ALOly; — yj-il, (E21.2) 


and hence that successive values of y; are closer together. Thus, for large j, we 
would expect that y;+1 © y; and that y; is a good approximation to x,41. 
(iii) Still assuming that h < 1/L@, use (E21.2) to show that 


lyj41 — yl < (ALO) y1 — yol, 


and hence that 


| = os | | 

Yi — Yel = Tap g!¥1 — Yo 
for any j,k > J. 

It follows that {y;} is a Cauchy sequence, and so converges to a limit y. Taking limits 


as j — oo on both sides of (E21.1) we get 
y=xt sl fn) + SOV. 


and thus x41 = y. 

21.6 (C) For a number of values of t and h compare the exact solution of x = x with the 
solution from Euler’s method, and verify the error estimate in (21.7). 

21.7 (C) Implement the backwards Euler scheme of Exercise 21.2 numerically, and apply 
it to the equation x = x(1 — x) to find the solution when x (0) = 5 forO <t<8.In 
order to find x,+41 given x, you can use the approach of Exercise 21.5, and iterate 


Skt = Xn thgx(1 — gx) 


to give a succession of ‘guesses’ gx for x,+1 until gg appears to stabilise (e.g. un- 
til |gx41 — ge| < A>). You will need to choose h carefully to ensure that your se- 
quence of guesses converges. (Can you work out, using the theoretical results of 


212 21 Euler’s method 


Exercise 21.5, what value of h should suffice?) The MATLAB M-file backeuler.m, 
implementing this scheme, can be downloaded from the web. 

21.8 (C) Write a MATLAB program to implement the Runge-Kutta method introduced at 
the end of the chapter. Apply this method to x = t — x? when h = 0.5, and compare 
this to the solution obtained using Euler’s method with the same timestep. (You can 
download the MATLAB M-file rungekutta.m from the web if you wish.) 
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Difference equations 


It is not only for numerical approximations that it is more appropriate to have an 
independent variable that only takes discrete values. For example, an experiment 
may take measurements at equally spaced time intervals, we may be interested in 
the size of a population in successive generations, or perhaps we want to compare 
the value of the Financial Times index of the top one hundred UK shares (the 
FTSE) at the end of trading on a sequence of successive days (see Figure 22.1). In 
all these cases, it is much more natural to have a dependent variable indexed by n, 
where n is an integer (x,,), than a continuous function of f. 

An equation that relates the values of x, for different values of n is called a 
difference equation, and the order of a difference equation is the largest difference 
between any two of the indices (attached to x) occurring in the equation, i.e. Euler’s 
method 

Xnt1 = Xn thf Xn, nh) 
is a first order difference equation, as is 
_ gnt7 


Xn+1 + COS Xn 


(the n + 7 is not an index of x), while 
Xn+2 = Cate — €xp(Xn-3) 


is fifth order. We have already seen some examples of difference equations in 
the recurrence relations for the coefficients in our power series expansions in 
Chapter 20. 


22.1 First order difference equations 


A first order difference equation relates the ‘next’ value of x and its current value, 


F(Xn, Xn+1; n) = 0. 
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Fig. 22.1. The FTSE index during the period of the second Gulf War, from the 
start of hostilities on the evening of 19 March 2003 until the fall of Tikrit on 
14 April 2003. The data consists of the values marked by crosses; these are joined 
by a dotted line to make the trends easier to follow. 


We will only consider here! equations in which x,41 can be given explicitly in 
terms of x, 


Xnt1 = fn, 2). 


For such equations we do not need a theorem to show us that there is a unique 
solution, since we have a fixed rule by which we can construct the solution. In 
this chapter we consider only linear difference equations, and turn to nonlinear 
difference equations in the next chapter. 

If we consider a simple linear difference equation 


Xn+1 = kXn 
and suppose that we know xo then it is simple to find x, for any n. We have 


x1 = kxo, 
x2 = kx, = k(kxo) = k’x0, 
x3 = kx = k(k*xo) = k*xo, 


and it is easy to see that in general x, = k” xo (we did something very similar in 
Section 21.4 of the previous chapter). 

The analysis of second order linear difference equations that we will now pursue 
is entirely analogous to that of second order linear differential equations, except 


! There are very natural difference equations for which this in not the case. For example, the ‘backwards Euler 
method’, which has nicer properties than the Euler method introduced in the previous chapter, is 


Xnt1 =4Xn thf n+). 


This ‘implicit’ scheme has to be solved for x,4 1 at every timestep as in Exercise 21.5. 
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that the exponential function x(t) = e*! (the solution of x = kx) that we used for 
differential equations is now replaced by x, = k” (the solution of x;41) = kxy). 


22.2 Second order difference equations: complementary function 
and particular solution 


We will concentrate now on linear second order difference equations, 
Xn42 + AXn41 + bx%y = fn, (22.1) 


although the techniques that we will introduce extend easily to higher orders. 

Just as we could split the problem of solving a linear second order differential 
equation into finding the complementary function and then a particular integral, we 
can split the problem of solving (22.1) into finding a solution of the corresponding 
homogeneous problem 


Ynt2 + Ayn41 + byn = 0, (22.2) 


and then of finding one particular solution x, of (22.1). 


22.3 The homogeneous equation 
First we deal with the homogeneous equation, 
AXn+42 + bxXn+1 +X = 0. (22.3) 


We guess that the solution is of the form x, = k”, just as we guessed a solution of 
the form x(t) = e*! for the second order differential equation ax + bx +cx = 0. 
Trying x, = k” in (22.3) we get 


ak"*? + bk" +! + ck" =0, 
and cancelling k” then gives the auxiliary equation 
ak? + bk +c =0. 


We obtain a quadratic equation for k, and the form of solution for (22.2) will 
depend on the nature of its roots. 


22.3.1 Distinct real roots 


If the auxiliary equation has two distinct real roots ky and kz then x, =k} and 
Xn =k are both solutions of (22.3), and so the general solution is 


Xn = Ak” + BS, 
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Particular values of x9 and x, will enable us to determine the constants A and B. 
As an example, we find an expression for the nth Fibonacci number. These are 
the numbers that satisfy 


Xn = Xn-1 + Xn-2; (22.4) 
and start with x9 = 1 and x; = 1. The first few are 
112 3 5 8 13 21 34 55 


These numbers crop up frequently in nature, and have fascinated artists for hun- 
dreds of years. A modern example is shown in Figure 22.2. 


Fig. 22.2. Mario Merz’ ‘Fibonacci Sequence 1-55’ on the chimney of the power 
station in Turku, Finland. Photograph by Dr Ching-Kuang Shene of Michigan 
Technological University, and reproduced with his kind permission. 
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To solve (22.4) we try x, = k”, and find 


k7=k+1. 


This equation has roots 


so that the general solution of (22.4) is 


wna 2°)" 4 0(%)" 


2 2 


The initial conditions require 
a+B=1 (4+V5)a+(1—V5)6 =2, 
and thus, solving for @ and 6, 


1+ 5 V5—1 
a= = 4 
2/5 
The nth Fibonacci number is therefore given by 
1 aay” 1 ofy" 
V5\ 2 V5\ 2 , 


It is somewhat surprising that this formula always gives an integer. Since the sec- 
ond term is always smaller than 1/ J5 & 0.4472, we can also write 


[Cy] 


where [[x]] denotes the nearest integer to x. 


Xn = 


(22.5) 


22.3.2 Repeated roots 


If the auxiliary equation has a repeated real root k, then we have a similar problem 
to the one we had when considering second order differential equations, since 
we will only obtain a single solution x, = k”. Thankfully the resolution of this 
difficulty is also similar: we introduce the extra factor of n where before we had 
an extra factor of t. The general solution is therefore 


Xn = Ak” + Bnk”. 
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To check that this second solution is correct we will substitute it into 
Xn42 — 2kxns1 + k?x_ = 0, 


since any second order linear difference equation whose auxiliary equation has k as 
a repeated root can be rewritten in this form (see Exercise 22.2). Trying x, = nk” 
in the left-hand side gives 


(n + 2)k"*? — 2k(n t+ Dt 4 Penk” = (n +2 -2(E04 1) + n)k"t? =0 
as required. 
Example 22.1 Find the general solution of 
Xn — 4Xn—1 + 4xy,_-2 = 0. 
Trying x, = k” yields the auxiliary equation 
k°—4k+4=0, 


so that k = 2 ‘twice’. The general solution is therefore 


Xn = a2" + Bn2”. 


22.3.3 Complex roots 


We can also have complex roots, for which things are a little more involved. If we 
have k = a ib, then we need to write k in modulus and argument form, 


where 
r=aet+h> and 6 =tan !(b/a) 
(see Appendix A). Then the solution is 
Xn =r"[Acosné + B sinné]. 
To see that this solution is consistent with our guess x, = k”, first write 
xX, = Clre}" + C*[re79]", 


where we take the coefficients to be C = a + if and C* (its complex conjugate) 
to ensure that x, is real. This gives 


Xn = r™[Cel”? + Cre), 
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Since z* + z = 2Re(z), and 


Cel” = (a + if)(cosné + isinné) 
= (acosné — Bsinné) +i(B cosné +a sinng), 


we get 
Xn =r"[Acosné + B sinné] 
if we take A = 2a and B = —28. 
Example 22.2 Find the general solution of the difference equation 
Xn42 — 2Xn41 + 2x, = 0. 


Trying x, =k” gives the quadratic equation 


k* —2k+2=0, 
and so 
24/4 - 
pS eS ie 
2, 
Since 


1ti= V2e%'"/4 


the solution is 


Xn = 2"/*[A cos(nt/4) + B sin(nw/4)]. 


22.4 Particular solutions 


When we have an equation with a non-zero right-hand side, 
AXn42 + bxXn41 + Xn = fn, 


we use the same method that we had for differential equations, i.e. we guess the 
form of the particular solution and then substitute in to determine the constants in 
our guess. 


22.4.1 Right-hand side f,, is a polynomial in n 


When the right-hand side is a polynomial depending on n the appropriate ‘guess’ 
for a particular solution is a general polynomial of the same order as the right-hand 
side. If our guess solves the homogeneous problem then we have to multiply by an 
additional factor of n. 
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We start with a simple first order example. 
Example 22.3 Find the general solution of 
Xn41 = kxy +a. 
The solution of the homogeneous equation 
Ynti =kyn, 
iS Y, = Ak”. To find a particular solution we can try x, = c and then we require 
c=kc+a, 


so we take c = a/(1 — k) and obtain the general solution 


Xn, = Ak" +a/(1—k). 
Example 22.4 Find the general solution of 
Xn — Xn—1 — OXy,-2 = —36n. 


First we solve the homogeneous equation yy, — yy—1 — 6¥p—2 = O by trying 
Yn = k”; we need k to solve the equation 


k?—-k-6=0 > (k — 3)(k +2) =0, 


and so the complementary function is y, = A3” + B(—2)”. For the particular so- 
lution we try a general first order polynomial in n, x, = an + 6. Substituting in 
we get 


an + B — (a(n — 1) + B) — 6(a(n — 2) + B) = —6an + 13a — 6B. 


So we need a = 6 and 6 = 13 which gives the particular solution x, = 6n + 13; 
the general solution is therefore 


Xn = A3” + B(—2)”" + 6n + 13. 


Example 22.5 Find the general solution of 
Xnt1 — 2Xn + Xy-1 = 8. 


To solve the homogeneous equation y,4+1 — 2y, + yn—1 = Owe try y, = k” and 
obtain the auxiliary equation 


k?—2k+1=0. 
This equation has the repeated root k = 1, and so the complementary function is 


yn = A+ Bn. 
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We cannot try x, = c for our particular solution (A is part of the complementary 
function), nor can we try x, = cn (since Bn is also part of the complementary 
function), so we have to try x, = cn*. Then we need 


c(n + 1)? —2cn* +. c(n — 1)? =c[n? + 2n + 1 — 2n? +n? — 2n 4+ 1] = 2c =8, 


ie. c = 4. Soa particular solution is x, = 4n7, and the general solution is 


Xn =4n*> + A+ Bn. 


22.4.2 Right-hand side f, = 2" 


This case is similar to having an exponential on the right-hand side of a differential 
equation. If 4 is not a solution of the auxiliary equation we try x, = aX”; if A is 
a non-repeated root of the auxiliary equation we try x, = anx", while if A is a 
repeated root we have to try x, = an7A". 


Example 22.6 Find the general solution of the equation 
Xn42 + Xnt1 — 6X, = 12(—2)”. 


To find the solution of the homogeneous equation yj+2 + yn+1 — Oy, = 0 we 
try y, = k” and obtain the auxiliary equation 


e+k-6=0 => (k+3)(kK-2)=0, 
so that k = 2 or k = —3 and the complementary function is 
Yn = A2” + B(—3)". 


Since (—2)” is not a solution of the homogeneous equation we can try x, = 
a(—2)"” for a particular solution; we need 


2 ho) = 6a 2" = 129)". 
Cancelling a factor of (—2)” we require 
(—2)a + (—2)a — 6a = 12, 
or a = —3. Soa particular solution is x, = —3(—2)”, and the general solution is 
Xn = A2” + B(—3)” — 3(-2)". C 
Example 22.7 Find the general solution of the equation 
Xn42 + Xn41 — 6X, = 30 x 2”. 


We found the complementary function above, y, = A2” + B(—3)”. Since the 
right-hand side occurs in the complementary function we have to try x, = an2” 
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for a particular integral. Substituting in we need 

a(n + 2)2"*? + a(n + 1)2” — 6an2” = 30 x 2”, 
or, cancelling a factor of 2”, 

a[4(n+2)+2(74+1)—6n]=100 = _ 30. 


So a = 3; a particular solution is x, = 8n2”, and the general solution is 


Xn = A2” + B(—3)” + 3n2". 


Exercises 


22.1 Find the solutions of the following difference equations satisfying the given initial 
conditions. 
(i) Xn42 — 44n41 + 3X, = 0 with x9 = 0 and x; = 1; 
(ii) 2X)41 — 3x, — 2x,-1 = 0 with x; = x2 = 1; 
(ili) Xp42 = 2Xp41 — 2X, With xo = | and x; = 2; 
(iV) Xn42 + 6Xy41 + 9X, = 0 with x9 = 1 and x; = 6; 
(Vv) 2x, = 3xXn-1 — Xn—2 With x9 = 3 and x1 = 2; and 
(Vi) Xn42 — 2Xn41 + 5xXn = 0 with x9 = V5 and x; = 5 cos tan! 2. 
22.2 Show that if the auxiliary equation 


ak? +bk+c=0 
has a repeated root k = A then the difference equation 
AXn+2 + bxn41 + CX = 0 
can be rewritten in the form 
Xn42 — 2AXn41 +A7Xy = 0. 


22.3 The ‘golden ratio’ is the ratio (greater than one) of the sides of a rectangle with the 
following property: remove a square whose sides are the length of the shorter side of 
the rectangle, and the remaining rectangle is similar to the original one (its sides are in 
the same ratio), see Figure 22.3. This ratio was used by the Greeks in constructing the 
Parthenon (among many other monuments), and has been a favourite tool of artists 
ever since. 

Suppose that {x,,} is a sequence of numbers satisfying the recurrence relation 


Xn+2 = Xn+1 + Xn- 


Show that if all the elements of the sequence are integers then the ratio of consecutive 
terms, Xn+1/Xn, converges to the golden ratio. Show that the same result is true if 
all the terms in the sequence have the same sign. (In particular this is true for the 
Fibonacci numbers, which have x9 = 0 and x; = 1.) 
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Remove 
this 
square 


Fig. 22.3. The golden rectangle. 


22.4 Find the general solution of the following difference equations, and then find the 
solution that satisfies the specified initial conditions. 
(i) Xn42 — 4X, = 27n?, with xo = 1 and x; = 3; 
(i) Xn41 — 4X, + 3x, -1 = 36n?, with x9 = 12 and x; = 0; 
(ii) X41 — 4%, + 3x_,_-1 = 2”, with x9 = —4 and x; = —6; 
(IV) Xn41 — 4%) + 3xXp,-1 = 3”, with x9 = 2 and x; = 13/2; 
(V) Xn42 — 2Xn41 +X, = 1, with x9 = 3 and x; = 6; 
(Vi) Xn42 + Xn = 2”, with x9 = x; = 0; 
(Vii) Xn42 + Xn+1 + Xn = c (the general solution is enough here). 
22.5 Find the solution of the difference equation 


Xnt1 = Xn(1 + Xn) 


with x; = 1. Now show that if x; = c then 
I (c+n) 
“Te+D 
where the I’ function, which was defined in Exercise 20.8, satisfies T(x + 1) = 
x(x). 


xn = 


23 


Nonlinear first order difference equations 


In general the solutions of a nonlinear difference equation 


Xnt1 = fn) (23.1) 


can have very complicated behaviour. To find the solution of such a difference 
equation we have to iterate (apply repeatedly) the map f (‘map’ is just another 
word for function, and is frequently used in this context). The solution is given by 
the sequence of iterates 


x0, x= fo), x=f(FGo), 3 =fFFC0))), --- 


called the ‘orbit’ of xo. Since these nested fs rapidly become unmanageable we 
adopt the notation f”(x) to mean f applied n times to x, 


f"@) =fFSFC--£@)--). 
ae ea 
n times 


We can write the ‘solution’ of (23.1) that has x9 = yo as x, = f” (yo), but this is 
clearly no more descriptive of the solution than (23.1) itself. 


23.1 Fixed points and stability 


In order to describe the dynamics of solutions we make use of similar concepts as 
we used in Chapter 7 for the one-dimensional dynamical systems that arise from 
autonomous differential equations. In particular it is often useful to concentrate on 
what happens to solutions ‘eventually’. 

For an iterated map such as (23.1) a fixed point is a point x* such that 


gb ea 


so that if x, = x* then x,4,; = x*. (The fixed points of x,41; = f(x») are analo- 
gous to the ‘stationary points’ of x = f(x).) 


224 
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We describe a fixed point as stable if you stay close to it provided that you start 
sufficiently near: x* is stable if for any € > 0 there exists a 6 > O such that 


Ixo —x*| <6 > FG) = 2" | ee: forall, ge '0,; 132.9 
SS a 
start near stay near 


Again, we can also introduce the related but distinct concept of being attracting 
(‘start near tend to’): x* is attracting if there is a 6 > O such that 


Ixo —x*| <6 => f"(x) > x* as n>oo. 
a ee 
start close enough tend to 


Negating ‘stable’ we get unstable: x* is unstable if there exists an € such that no 
matter how small we make 6, we can find an xg with 


Ixo —x*| <6 but |f"(xo) —x*| >e forsomen > 0. 


In order to discover analytically whether or not a fixed point is stable, suppose 
that x, = x* + 6, where 64, is small; then, using a Taylor series expansion about 
* 
ee 


Xn41 = f(x* + bn) 
f(x") + f(x" )bn 
= =F f QO on 
So if we write x,41 = x* + 6,41 we have 
Snt1 © f'(x*) bn. (23.2) 


The solution of (23.2) is 5, = [f’(x*)]"5q, so it is clear that successive values 
of 5; will decrease if | f’(x*)| < 1, and increase if | f’(x*)| > 1. So x* is stable 
and attracting if | f’(x*)| < 1, and unstable if | f’(x*)| > 1. (This can be made 
rigorous, see Exercise 23.4.) 


23.2 Cobweb diagrams 


In order to work out what happens when we iterate f we can use a graphical 
method which produces a picture known as a ‘cobweb diagram’. First we draw 
the graph of f(x) against x, and then the graph of y = x. Now, given x, we can 
find x41 by drawing a line up to f(x,), then over to y = x, and back down to the 
x-axis to give X41. To find x,42 we do the same, but starting at x,+1 — we start 
now from the point on the diagonal, rather than the point on the x-axis, since this 
simplifies the picture — and so on. This is illustrated in Figure 23.1, for the example 


f(x) =3xd0 —x). 
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1 T T 


Fig. 23.1. The first few steps of the ‘cobweb’ method for x41; = 3x,(1 — xn). 


23.3 Periodic orbits 


For any choice of xo, the linear difference equation 


Xn+1 = —Xn 


generates an orbit that flips between the two values x9 and —xg. This is a simple 
example of a periodic orbit of period 2, or more concisely a period 2 orbit. In 
general if x,41 = f (xn) then a periodic orbit of period 2 is a pair of values x; and 
x2 such that 


Ff (x1) = x2 and F (x2) = x1, 


so that f7(x;) = x1, see Figure 23.2. 
A periodic orbit of period k (or a period k orbit) is a sequence of k values 
{x1,..., Xx} such that 


Sf (xj) =Xj41 for jz=l,...,n—-1 and fQ@n) = x1, 


so that iterates of x; cycle around these k values for ever. (Strictly we also need to 
make sure that f(x;) # x; for j = 1,...,n — 1, so that k is the ‘minimal period’ 
of the orbit.) We will see several examples of such periodic orbits in the next 
chapter. 
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Fig. 23.2. A period 2 orbit for the map x,4; = (1+ /5)Xn(1 — xp). On the left 
is the cobweb diagram, while the right-hand picture shows successive values of 
Xp, against n. 


23.4 Euler’s method for autonomous equations 
In the remainder of this chapter we apply some of the above ideas in order to 
understand how well the qualitative behaviour of the differential equation 


dx /dt = f (x) (23.3) 


(which we studied in detail in Chapter 7) can be captured using the numerical 
Euler method introduced in Chapter 21. Recall that the stationary points x* of 
(23.3) occur when f (x*) = 0, and that they are stable if f’(x*) < 0 and unstable 
if f’(x*) > 0. 

If we apply the Euler method to (23.3) with a timestep / then we have 


Xnt1 = Xn thf On), (23.4) 


where x, iS an approximation to x(t,) with t, = nh. We will write g(x) =x + 
hf (x), so that (23.4) can be written more concisely as 


Xnt1 = 8(n). 
The fixed points of the map g are those x values x* for which x* = g(x*), 
i.e. for which 
x*=x* +hf(x*). 


This means that the fixed points of g occur when f (x*) = 0, i.e. they are the same 
as the stationary points of the differential equation (23.3). 

To determine the stability of the fixed points we have to consider the modulus 
of 


g(x) =1+hf'(x). 


228 23 Nonlinear first order difference equations 


A fixed point at x* will be unstable whenever 
Ig'(x*)| > 1. 


This happens when (i) f’(x*) > 0, or (ii) f’(x*) < O andh > 1/|f’(x*)|. In case 
(i) the point x* is an unstable stationary point for the differential equation, but case 
(ii) says that although x* is a stable stationary point for the differential equation, 
the timestep h is too large to reproduce this stability property in the numerical 
method. 

On the other hand, a fixed point at x* is stable provided that 


Ig'(x*)| <1 


which happens when f’(x*) <0 and h < 1/| f’(x*)|. So x* has to be a stable 
stationary point of the differential equation and the timestep h has to be sufficiently 
small. 

It follows that the stationary points of a differential equation such as (23.3), 
along with their stability properties, will be reproduced correctly by the Euler 
method provided that / is small enough so thath < 1/|f(x*)| for every stationary 
point x*. 

We now look at these phenomena as they occur in a particular example. 


23.4.1 An example 
We will apply Euler’s method to the equation 


dx/dt=x(k—x’*) with k>0, (23.5) 
which gives 
Xnt1 = Xn + hxn(k — x2). (23.6) 
We will write f(x) = x(k — x) and a(x) =xt+hx(k -— x’). 
For k > O equation (23.5) has an unstable stationary point at the origin and 


stable stationary points at x = +/k, as shown in the phase diagram in Figure 23.3 
(cf. Figure 7.13). 


Fig. 23.3. Phase diagram for equation (23.5). 
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of: 
0 1 2 3 4 5 0 1 2 3 4 ts] 


Fig. 23.4. Euler’s method applied to + = x(16 — x”) with h = 0.1 and initial 
conditions x9 = 4 and x9 = 5. The timestep is too large to preserve the correct 
stability properties of the point x = 4. [In this and the following three figures, the 
left-hand side shows successive values of x, against n, while the right-hand side 
shows the cobweb diagram of the same orbit.] 


The Euler scheme has fixed points at 0, —/k, and /k, and their stability is 
determined by the size of 


lg’(x)| = |L + hk — 3x7). 
Since we have 
le’(O)| =1+kh and — |g'(+vk)| =|1 —2hkl, 


the fixed point at the origin is always unstable, while the fixed points at x = +/k 
are stable if h < 1/k and unstable if h > 1/k. 

We now fix k = 16 and choose three values for h, concentrating on the be- 
haviour of positive solutions. First, if h = 0.1 then h = 0.1 > 1/16 = 1/k, and 
the fixed point at x = 4 is unstable. The successive values of x,, along with the 
cobweb diagram, are shown in Figure 23.4. 

If we decrease h a little to h = 0.065, then it is still greater than 1/16. Fig- 
ure 23.5 shows that the orbit has settled down to a period 2 orbit rather than to the 
fixed point. 

If we now reduce h so that it is less than 1/16 then we would expect x = 4 to be- 
come stable. Figure 23.6 shows the results of the method with h = 0.06. Although 
the fixed point at x = 4 is now stable, the sequence x, oscillates about x = 4 as it 
converges. This is in contrast to the behaviour of the differential equation, where 
x(t) would approach x = 4 monotonically. 

Finally, Figure 23.7 shows the result of the method when h = 0.01. The timestep 
is now small enough so that the Euler method yields a very good approximation to 
the solution of the ordinary differential equation. 
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Fig. 23.5. Euler’s method applied to + = x(16 — x”) with h = 0.065. The solu- 
tion has ended up switching between two values of x on a period 2 orbit. 


0 2 4 6 8 
Fig. 23.6. Euler’s method applied to « = x(16 — x*) with h = 0.06 and initial 


conditions x9 = 4 and xo = 5. The timestep is small enough so that x = 4 is sta- 
ble, but solutions oscillate as they approach this fixed point. 


Exercises 


23.1 Show that there is an orbit of period 3 containing the point x = 1 for the difference 
equation 


2 
Xn+1 = Xn Xn + 


23.2 Suppose that the differential equation x = f(x) has a stationary point x* where 
f'(x*) < 0. We saw that the point x* is a stable fixed point for 


Xn41 = Xn +hf (xn), 
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0 5 10 15 20 eo 0 1 2 3 4 5 


Fig. 23.7. Euler’s method applied to x = x(16 — x*) with A = 0.01 and initial 
conditions x9 = 1, x9 = 4 and x9 = 5. The timestep is small enough so that the 
method approximates the solution of the differential equation very well. (Two 
orbits in addition to the fixed point are now shown in both diagrams.) 


provided that h < 1/| f’(x*)|. Assuming that xo is sufficiently close to x*, show that 
if h > 1/(2|f’(x*)|) then x, is alternately greater than and less than x*, while if 
h < 1/(2|f’(x*)|) the orbit x, approaches x* monotonically. 

In this question we consider the trapezoidal Euler method 


Xnt1 = Xn + ZALS On) + f Ong]. 


Show that x»41 = x, = x* if and only if f(x*) = 0, ie. that the fixed points of the 
numerical scheme coincide with the stationary points of the differential equation x = 
f(x). 

Using the chain rule show that 


dxn+1 1+ shf' (Xn) 


din 1 — hf! (n+) 


and hence that a fixed point x* is stable if f’(x*) < 0 and unstable if f’(x*) > 0, 
i.e. that whatever the timestep the stability coincides with that of the corresponding 
stationary point in the differential equation. 

(T) It follows from the definition of the derivative that 


FO" +h) = fx*) + fh + off), 


where o(h) indicates that the remainder terms satisfy 


o(h) 
— > 
h 


In particular, given € > 0 there exists a 6 > 0 such that 


0 as h— 0. 


lo(h)| < €h 
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for all |h| < 6. Use this to show rigorously that a fixed point x* of xn41 = f(%n) is 
stable if | f’(x*)| < 1 and unstable if | f’(x*)| > 1. (Recall that a fixed point x* is 
stable if given an € > 0 there exists a 6 > 0 such that whenever |xg — x*| < 6 we 
have | f" (xo) — x*| < € for alln = 0, 1,.... In fact you should be able to show that 
when | f’(x*)| < 1 the fixed point is attracting.) 
(T) Suppose that f has a periodic orbit of period k consisting of the points 
{x1,..., Xz}. Show that each of the points on the orbit is a fixed point for the map 
Be SF"): 

A periodic orbit is said to be stable if each point on the orbit is a stable fixed 
point of f*. Show that a periodic orbit {x1, x2} of period 2 is stable provided that 


lf) f'(x2)| < 1, 
and that a periodic orbit {x,,..., xx} of period k is stable provided that 
If’ Ca) f (x2) +++ fOr fx) < I. 


Note in particular that if one point on the orbit is a stable fixed point of f* then so 
are all the others. 
(T) Consider the iterated map 


Yat+l = f On) =1Yn + ye 


for r < 0. Find the two fixed points, and show that the fixed point at y = 0 is stable 
for —1 <r < Oand unstable forr < —1. 
Show that if y lies on an orbit of period 2 then 


ytr+Dyt++)=0, 


and deduce that there is a period 2 orbit if r < —1. Hint: we must have f(y) = y, 
and you can factorise the resulting equation since f(0) =Oand f(l —r) =1-r. 
If y; and y2 are the points on this orbit, show that 


f' Of’ 02) =4+42r—r?, 


and hence that this orbit is stable for 1 — /6 <r < —1. 

(C) Apply Euler’s method with timestep h to the equation x = x(k — x) (cf. Exer- 
cise 7.6). Investigate how the stability of the fixed points depends on k and h. Now 
implement this Euler scheme numerically and verify your results (e.g. compare the 
cases k = | and k = 3 with timestep 4 = 1). (You could adapt the MATLAB M-file 
euler .m, which is available on the web.) 
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The logistic map 


In this chapter we consider a particular example of a nonlinear difference equation, 
the logistic map 


Xn+1 = 71XnC — xn). (24.1) 


Despite its simplicity the orbits of this equation can be extremely complicated, and 
this has made it one of the standard models in the theory of dynamical systems and 
chaos. 

We will vary the parameter r between 0 and 4, since then if x, lies between 
zero and one, so does x,+1 (the maximum value of rx(1 — x) occurs when x = 5 
and is r/4). The graphs of f(x) = rx(1 — x) are shown in Figure 24.1 for various 
values of r. 

The equation can be thought of as a discrete model of a population (we saw a 
very similar differential equation earlier) with limited resources. If the population 
is small (measured, presumably, in thousands or millions to give an approximately 
continuous variable) then (24.1) predicts that its size will increase, since when 
Xn © 0, 


Xn4+1 ~~ Xn. 


But when x, approaches the maximum size sustainable by the resources available 
(which here is 1), the population dies out rapidly; if x, = 1 — y, with y, ~ 0 then 


Xn+1 ~TYn, 


so that most of the population dies out. 

What one would naively expect from such a model is that the population size 
would settle down to some steady state (as in the solution of the differential equa- 
tion x = rx(1 — x)), or perhaps oscillate between two nearby states, and indeed 
this does happen when r is small. However, as r increases towards 4 this intuition 
can be very wrong. 
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Fig. 24.1. Graphs of rx(1 — x) for (left to right) r = 1, 2, 3 and 4. 


We will base our discussion around the existence of fixed points and periodic 
orbits. However, we will also make use of the idea of the ‘attractor’; rather than 
defining it precisely here, the best way to think of it is as the points around which 
the orbits of x;41 = f(xy) will move ‘eventually’. 


24.1 Fixed points and their stability 


First note that there are at most two fixed points lying between zero and one. Fixed 
points are the solutions of 


x =rx(1—x); 


these are x = 0 and x = 1 — (1/r). Since we want to consider positive popula- 
tions, the non-zero fixed point will only be interesting when r > 1. 

If we calculate f’(x) =r(1 — 2x) then f’(0) =r; the fixed point at x = 0 will 
be stable while r < 1, and unstable once rr > 1. 

If 0 <r <1 then 1 — (1/r) < 0 and so there is no positive fixed point, and 
over time the population decreases to zero (the interpretation of this being that the 
reproductive rate is not high enough to sustain the population). In this case the 
dynamics are very simple; x = 0 is stable and attracting, see Figure 24.2. 

When r > | the origin is no longer stable, and there is another positive fixed 
point. Since 


f'd-(/r) =r -—2+ Q/n))=2-7, 


this fixed point is stable while r < 3. So for 1 <r < 3 all orbits are attracted to 
1 — (1/r), as shown in Figure 24.3. 


24.2 Periodic orbits 


When r increases beyond 3, things become more complicated. The fixed point at 
1 — (1/r) is now unstable, since the derivative of f in that case has modulus larger 
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Fig. 24.2. For 0 <r < 1 (here r = 0.8) the origin is a stable fixed point, and 
the population dies out. This, and all similar figures in this chapter, show the 
cobweb diagram of a representative orbit on the left, and the successive values of 
X, against n for the same orbit on the right. 
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Fig. 24.3. For 1 <r < 3 the origin is unstable, and there is an attracting non-zero 
fixed point. The pictures here are for r = 2.5. 


than 1. If r is just a little larger than 3, almost every choice of initial condition 
(apart from either fixed point) ends up cycling between two different values of x 
on a period 2 orbit, as shown in Figure 24.4. 

In the figure, r = 1 + /5, and one point on the period 2 orbit is x = 5. To see 
that this really does give a period 2 orbit, we first calculate 


fG)=04+-75)40- 3) = 4, 
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Fig. 24.4. For r > 3 there are periodic orbits of period 2. This picture has r = 
1+ J5 © 3.2361. 


and then 


Pa)=f (: 4) = (1+ V5) ( ~) ( - +) 


6+2V75\ (3-V5\ __ 18- 6/5 + 65 — 10 
4 4 J 16 


! 
= 5. 


If we try to find a period 2 orbit analytically for general r then we want to find 
a value of y such that if x, = y then x,42 = y. So we want 


y= f°) 
which is y = f(ry(1 — y)) or 
y=rlryd —y)J[1—ryd — yl. (24.2) 


This is a quartic (fourth order) equation for y; but since we know that y = 0 and 
y = 1-—(1/r) must be solutions (they are fixed points with f(y) = y, so cer- 


tainly f(f(y)) = fQ) = y) we can remove a factor y(y — [1 — (1/r)]). If yisa 
period 2 point it must therefore solve the equation 


ry? —(L+r)y + (1 + *) = 0: oe) 
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Fig. 24.5. As r passes through 3 the interior fixed point becomes unstable, and 
a stable orbit of period 2 appears. The stable fixed point is shown as a solid line, 
the unstable fixed point as a dotted line, and the two points on the stable period 2 
orbit as a dashed curve. 


This equation only has real roots if the discriminant is positive (b* — 4ac > 0), 
Le. if 


ane —4r (142) =(l¢ry(r—3)>0. 


Since we have restricted to the parameter range 0 <r < 4 the factor (1 +r) is 
positive; for a period 2 orbit we must therefore have r > 3. 

Note that the two points on the periodic orbit (the solutions of (24.3)) are given 
by 


(d4+r+VJ/U4+r)(r —-3) 
2r , 
When r = 3 this would give 2/3, which is the position of the positive fixed point. 
Whenr > 3 the fixed point at 1 — (1/r) is unstable, and the two points on the orbit 
“split off’ on either side of the fixed point, see Figure 24.5. 


24.3 The period-doubling cascade 


As r is increased a little further, this attracting periodic orbit of period 2 becomes 
unstable — two points break off from each point on the orbit, and we end up with 
an attracting orbit of period 4, as shown in Figure 24.6. 
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Fig. 24.6. A period 4 orbit when r = 3.5. 


0 0.2 0.4 0.6 0.8 1 0 2 4 6 8 10 
Fig. 24.7. A period 8 orbit when r = 3.55. 


Increasing r tor = 3.55 we obtain an attracting period 8 orbit, as in Figure 24.7. 

When r is a little larger the period of the orbit doubles again to 16, then again 
to 32, then again to 64... This is known as the “period doubling cascade’. The 
parameter values at which these successive period doublings occur get closer and 
closer together, and converge towards a critical parameter value r ~ 3.5701. 


24.4 The bifurcation diagram and more periodic orbits 


When r is increased beyond this critical parameter value the behaviour of solutions 
becomes extremely complicated. One way to try to keep track of it is to draw the 
bifurcation diagram, as shown in Figure 24.8. The horizontal axis represents the 
parameter r, and for each value of r the attracting set is plotted vertically; this is 
done by choosing an initial condition, applying f a large number of times (here 
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Fig. 24.8. The bifurcation diagram for 0 < r < 4. After one hundred iterations, 


the next thirty are plotted vertically for a number of r values. 


0.3 
3.4 3.42 3.44 3.46 3.48 3.5 3.52 3.54 3.56 


r 
Fig. 24.9. The bifurcation diagram for 3.4 <r < 3.6. 


3.58 3.6 


100) until the orbit has ‘settled’ down, and then plotting more points of the orbit 


on the vertical axis. 


The diagram shows the stable fixed point at x = 0 whiler < 1. Forl1 <r <3 
the fixed point at the origin is no longer stable, and the fixed point atx = 1 — (1/r) 
is attracting. For r > 3 this fixed point becomes unstable, and instead all orbits 
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r r 
Fig. 24.10. The portion of the bifurcation diagram in the small box of the left- 


hand figure is magnified in the right-hand figure, and proves strikingly similar to 
the original picture. 


are attracted to a periodic orbit of period 2. Figure 24.9, a magnified version of part 
of the previous figure, shows that at a value of r between 3.4 and 3.5 the period 2 
orbit becomes unstable, and orbits settle down to an orbit of period 4. We can also 
see the first part of the period doubling cascade occurring as r increases, until the 
parameters become too close to distinguish. After the critical value of r ~ 3.5701 
everything becomes extremely complicated. 

However, there is still order. For example, you can see that this diagram is ‘self- 
similar’ by magnifying a small portion and observing that it looks very similar to 
the original diagram, as shown in Figure 24.10. 

Notice also that there are ‘windows’ in which the solution is more regular again, 
for example in Figure 24.11 you can see that for r values around 3.835 there is a 
period 3 orbit; this is shown in Figure 24.12. 

By the same mechanism that produces the orbits of period 4, 8, etc. from the 
initial period 2 orbit, this period 3 orbit will period double to 6, 12, 24, 48, etc. and 
lead to another chaotic region. 


24.5 Chaos 


The phenomenon of chaos consists, essentially, of deterministic motion in which 
the motion appears to be random, see Figure 24.13. 
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Fig. 24.11. The bifurcation diagram for 3.6 <r < 4. 
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Fig. 24.12. A period 3 orbit for r = 3.835. 


A more rigorous defining feature of chaotic systems is that small changes in the 
initial conditions will produce wildly different behaviour if we wait long enough. 
This phenomenon is known as sensitive dependence on initial conditions, and is 
illustrated in Figure 24.14, which shows successive values of x, when r = 4 for 
initial conditions that agree to the fourth decimal place. 

When a model exhibits such sensitive dependence on initial conditions it will be 
of little use for predicting the future, since tiny errors in the initial conditions will 
lead to very different outcomes. However, this pessimistic observation has a more 
optimistic converse; even if the behaviour of a system appears very complicated, 
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Fig. 24.13. A chaotic orbit of x)41 = 4x%,(1 — xn). 
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Fig. 24.14. Successive values of x, vs n when r = 4, for initial conditions x9 = 
0.2189 (top) and x9 = 0.2188 (bottom). For n > 9 the iterates are completely 
different. 


or even random, it may still be subject to a very simple underlying rule. Since 
the advent of chaos theory, much experimental data that was once discarded as 
spurious and useless has been re-analysed and found to contain a high degree of 
order. 


24.6 *Analysis of x41 = 4x,(1 — xn) 
When r = 4 the map is 


Xn+1 = 4Xn(1 — Xn), 


24.6 *Analysis of Xn41 = 4Xn (1 — Xn) 
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Fig. 24.15. The ‘tent map’ obtained from our original map by a substitution. 
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and we can simplify this problem by means of a careful substitution. Since x, € 


[0, 1], we can set x, = sin? 6n, with 6, € [0, 7/2]. The equation for 6, is then 


sin? On+1 = 4 sin? On cos? On 
= sin” (20,). 


Since we want 6 € [0, 7/2], we can take 


On+1 = 


If we rescale 6, setting y, = 26,/7, to give y € [0, 1], we obtain 


Ynt+l = 


26 
7 — 206, 


2Yn 
2(1— yn) 4 


0<0, < 7/4 
7/4 <6, < 7/2. 


j=) 


n 


This new map (the ‘tent map’) is shown in Figure 24.15. 
The easiest way to consider the dynamics of this new map is by writing down 


the ‘binary decimal’ expansion of yy, 


where 


Yn = A0.a1d20304..., 


Ce . 
Yn = paras 
j=0 


Although we appear to have made the problem significantly more complicated this 


way, it will enable us to understand the dynamics much more easily. 
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Doubling the binary decimal 
y = 0.ajaza3a4... 


corresponds to shifting the ‘decimal point’ one place to the right (analogous to 
multiply by 10 for normal decimals), 


2y = a}.d203a4d5.... 


Subtracting y from 2 swaps all the Os and 1s: we will denote the operation of 
swapping 0 and 1 by a bar, 


“ 1 ifa=0O 
a= : 
0 ifa=l, 
and so 
2 — dy.d1 20304... = dy.d1d2A304.... 


In the range of y that we are considering (y € [0, 1]) we always have ap = 0 
provided that we represent 1 by 0.1% (we use the notation (7; ...7r,)°° to mean 
r,...Yp repeated ad infinitum). If y, < 1/2 then we also have a; = 0, and so 


Yn+1 = 0.a2a3d405.... 


If y, > 1/2 then ay = 1, and we have 


Yn+1 = 2- 1.a2a3a4a5 = 0.a24304a5 sated 
Therefore we can rewrite our map as 


: 0.a2a3a4a5... fora; =0 
if = 0.a,a2a3a4... then l= ash, as dU 
a vat 0.a2a3a4a5 ... for Qi = 1. 


With the map written in this way it is possible to understand its dynamics fairly 
easily. 
Suppose that yo is rational. Then just as in base 10, its decimal expansion will 
either be finite, or eventually repeat. If its decimal expansion is finite, 
yo = 0.41... an, 
then y; = 0 for all j =n, and the orbit ends up at zero after a finite number of 
iterations. If the decimal expansion eventually repeats then 
yo = 0.b,b2b3 ... by (ayaz .. .dm)~ 

In this case we will have 

y= 0.bob3babs5 re by (a\a2 or Am) or 0.bob3b4bs ne by (a, a2 oe Gm); 


yo = 0.b3b4b5....by(ayaz...dm)~ or 0.b3b4b5...bn (Gaz... Gm), 
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until after n iterations we have 
Yn = 0.(aja2...Am)~ or 0.(@]a2...dm)™. 


Without loss of generality we can consider what happens when 
Yn = 0.(ajaz...am)™. 

There are now two possibilities. Either f” (y,) = yn, in which case the orbit re- 

peats every m iterations; or all the zeros and ones of y, are interchanged and 

Ft’ On) = Yn. In the second case, since f(y) = f(y) for any y, 


Sf" One) = FF On) = FF" On) = £ On) = £On) = Yat 


and so the orbit repeats every m iterations after just one more application of /f. 
Thus every rational number lies on an orbit that is eventually periodic. 

However, none of these periodic orbits can be stable. Indeed, all initial condi- 
tions that are not equal will eventually separate, falling on different sides of x = 4 


7: 
Suppose that 
yo = 0.d1d2... dn An+1 wie and Z0 = 0.a1d2...dnby+1 Pare 


with dyn41 4 by+1; then the first decimal places of f”(yo) and f”(zo) will not be 
equal. 

If yo is irrational then the decimal expansion will go on for ever and never repeat. 
It follows that the orbit starting at yo will also never repeat, and so cannot be 
periodic. 

We have a strange situation, then. There are no stable orbits, but all rational 
numbers are eventually periodic or reach zero after a finite number of iterations. 
Any number that starts irrational will have a binary expansion, and hence an orbit, 
that never repeats itself; one can also show that the distribution of points along 
such an orbit is effectively random, even though the evolution is deterministic. 

The idea of following an orbit by seeing which side of x = 5 it falls (which is 
what the binary expansion here does very simply) can be generalised into a useful 
technique for analysing the original logistic map for other parameter values. This 
model is now very well understood, and is one of the standard examples used in 
the theory of dynamical systems. 


Exercises 
24.1 Consider the iterated map 


Xn41 = ra _ x): 


Show that for 0 <r < 3/3 /2 if 0 < x, < 1 then 0 < x74) < 1. Show that ifr < 1 


then the only fixed point in [0, 1] is zero, and that this is stable. 


246 


24.2 


24.3 
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Fig. 24.16. On the left is the graph of f and f? (in bold) for r = 3.5. On the right 
is a magnified version of the box in the left-hand figure, showing that f* has a 
fixed point. 


0.8 


0.6 


When r > | there is another fixed point in [0, 1]. Find the value of this fixed point 
(as a function of r). For which values of r is it stable, and for which values is it 
unstable? 

What would you expect to happen when r > 2? 

(C) Use the M-files Llogistic.m (which draws cobweb diagrams), xnvsn.m (which 
plots successive values of x, vs n) and bifurcation.m (which draws the bifur- 
cation diagram for a given range of r) to investigate the dynamics of the logistic 
map. Modify the programs to investigate the dynamics of the map in the previous 
exercise. 

(C and T) The M-file £2.m plots the graph of f(x) and of f 2(x) in the left-hand 
figure, and the graph of f? restricted to the little box in a blown-up version on the 
right (see Figure 24.16). By looking at a succession of pictures as r increases from 
0 to 4, observe that the rescaled version of {7 behaves in the same way that f does 
as r increases. This can be made precise, and explains the period doubling cascade. 
Since the fixed point of f becomes unstable and gives rise to a period 2 orbit, the 
same thing happens to 7; its fixed point (a period 2 orbit for f) becomes unstable 
and gives rise to a period 2 orbit (a period 4 orbit for f). Since whatever happens to 
f happens to f?, whatever happens to f7 happens to (f*)? = f7; its fixed point 
(a period 4 orbit for f) will become unstable and give rise to a period 2 orbit 
(a period 8 orbit for f). Similar reasoning holds for each orbit of period 2‘, showing 
that it becomes unstable and produces an orbit of period 2‘+!. The map formed by 
restricting f* to the little box, and then rescaling to the interval [0, 1], is known as 
the renormalisation of f . You can investigate the dynamics of the renormalised map 
as r changes using the M-file renormalised.m. 
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Coupled linear equations 
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*Vector first order equations and higher order equations 


All the equations that we have considered so far have been first order equations in 
which there was only one dependent variable (e.g. x(t) or y(x), where x and y are 
scalars). If we were restricted to equations in which there is only one dependent 
variable then this would exclude the vast majority of applications: for example, 
specifying the position of something in the three-dimensional space in which we 
live requires three coordinates. 

Although it is much harder to find solution methods for equations involving a 
number of dependent variables, the theoretical ideas are straightforward general- 
isations of what we did for scalar equations in Chapter 6. Here we make precise 
what we mean by a solution, and state the theorem that guarantees the existence 
and uniqueness of solutions under easily checked conditions. 

Suppose that we have n dependent variables x1, ..., x,, and each of these obeys 
a differential equation with the right-hand side (perhaps) depending on some of 
the other variables, 


xy = fi, x2, iva diy bY 
B60) = S21, x2, see Xny t) 

(25.1) 
Xn = tn@1, X2, ° 1.5 Xn,t). 


This is a set of n coupled first-order equations which we can write in a much more 
convenient way if we make use of vector notation. 
We write the n dependent variables x1, ..., x) as a vector x, 


x] 
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and define a vector function f(x, t) by 


Fi@i, «++, %nst) 
Saige ss Rast) 
(KS tei = ot 
fa, ..+,Xn,t) 
With this notation we can rewrite the coupled equations in (25.1) as 

dx 

“ane t f it ’ 

ag ee 


which encourages us to find a theory that unifies the treatment of scalar equations 
and coupled equations. On the theoretical level this is possible, and here we give a 
general existence and uniqueness result which reduces to the scalar existence and 
uniqueness theorem (Theorem 6.2) when n = 1. 

First we define a solution of the appropriate initial value problem, cf. Defini- 
tion 6.1. 


Definition 25.1 A solution of the initial value problem 
dx : 
a7 = f(x, ft) with x(to) = Xo xe R’, (25.2) 


onan open interval I that contains to is a differentiable function x : I > R", with 
X(to) = Xo and x(t) = f(x, t) forallt € I. 


We now have essentially the same existence and uniqueness theorem as before 
(Theorem 6.2). We use the notation Df to denote the matrix of partial derivatives 
of f, 


afi /Ox1 +++ Of1/OXn 
Df = : oe : (25.3) 
Ofn/OX1 +++ Ofn/OXn 


Theorem 25.2 Jf f(x, t) and Df(x, t) are continuous functions of x (i.e. of x1, 
X2,...,Xy) and t for 


x € U = (a1, b1) x (a2, b2) Xr XK (an, bn), 


and for c < t <d then for any Xo € U and tg € (c,d) the equation (25.2) has a 
unique solution on some open interval containing to. 


Note that the only real change is that we have to check that all the partial deriva- 
tives Of; /dx; are continuous, rather than just the one derivative we get in the scalar 
case. You should also check that you are happy that the above definition and theo- 
rem reduce to the scalar case when n = 1. 
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We will spend some time with pairs of coupled first order equations (n = 2) 
in Chapters 26-36, and consider a particular example of three coupled equations 
(n = 3) in Chapter 37. 


25.1 Existence and uniqueness for second order equations 


Theorem 25.2 can also be used to show that second (or higher) order scalar equa- 
tions have unique solutions. If we consider, for example, the second order equation 


xX = f(x, x,t), 


then we recast this as a vector equation by defining another variable representing 
x. We let x; = x and x7 = x, so that! x = (x1, x2) = (x, x), and then 


xX, =xX=xX2 
; dx. , 
tg a Sk = FED = fO2, md, 


which we can rewrite as 
x 
— = g(x, f), (25.4) 


with 


= x2 
BS 0:= Ca - 


Existence and uniqueness are guaranteed by Theorem 25.2 for equation (25.4) as 
long as we provide an initial condition x(t9) = xo; because x(to) = (x (to); x (to)) 
this is equivalent to specifying both x (fg) and x (fo). 

Theorem 25.2 requires that g and Dg are continuous. Since x2 is continuous, we 
need f (x2, x1, t) to be a continuous function of x; and x2; and since 


0 1 
vo ne ies 


we need the partial derivatives of f with respect to x; and x2 to be continuous. So 
we can rewrite Theorem 25.2 in the form of Theorem 11.1 from Chapter 11. 


Theorem 25.3 Given a function f (x2, x1, t), suppose that f, df /dx1, and of /dx2 
are continuous functions for a, < x1 < a2, by < x2 < b2 and t) <t < t2. Then 


: x 
! Th the text we use the notation (x1, x2) to denote the column vector (2 ) 
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for all initial conditions 
x(to) = xo and x(to) = yo (25.5) 
with ay < x9 < a2, bt < yo < bo, and ty < to < tp there exists a unique solution 
of 
x = f(x,x,t) (25.6) 


on some interval I containing to, i.e. a continuous function with two continuous 
derivatives that satisfies (25.5) and the equation (25.6) on I. 


A similar trick to this can be applied to deal with higher order equations, see 
Exercise 25.1. 


Exercises 


25.1 By choosing an appropriate collection of new variables x;,..., x, rewrite the nth 
order differential equation 


d’x ds dx 
=f\apee oe a et 
dt” dt"— dt 
as a set of n coupled linear first order equations. Find the conditions on the function 


Ff (Xn, ---, x1, t) for the original differential equation to have a unique solution. 
25.2 Suppose that f is a Lipschitz function of x, i.e. that for some L > 0 


f(x) — f(y)| < LIx — yl. 


Use an argument similar to that of Exercise 6.3 to show that if x(t) and y(t) are two 
solutions of 


dx/dt=f(x) with ~—-x(0) = xo (E25.1) 


and z(t) = x(t) — y(t) then 

d 

—_|2\* < 2L|2/°, 
dt 


and hence that the solution of (E25.1) is unique. (You might find the Cauchy—Schwarz 
inequality |a - b| < |a||b| useful.) 


26 


Explicit solutions of coupled linear systems 


In the following chapters we will consider what happens when we have two de- 
pendent variables, x(t) and y(t). In a general pair of coupled first order equations 
the derivative of x can depend not only on x and f, but also on y, and vice versa, 


ees 
y= g(x, y,?). 


The solutions x(t) and y(t) are, at least in principle, inextricably entangled with 
each other. Such systems are often referred to as ‘two-dimensional’, since the state 
of the system can be completely specified by the two variables x and y. The best 
way to understand the solutions of such coupled equations is graphically, using a 
two-dimensional version of the phase diagrams we drew in Chapter 7. The remain- 
der of the book concentrates on this graphical approach. 
However, in this chapter we first discuss a method for finding explicit solutions 
for a pair of coupled linear equations, 
fea aettad 2 (26.1) 
y=cx+dy+ g(t). 


We saw in the previous chapter that by introducing extra variables we can rewrite 
a second order equation as two coupled first order equations. Given two coupled 
linear equations we can reverse this process. This method is the most reliable if 
you want to find an explicit solution of the equation, and in particular is good for 
dealing with the inhomogeneous problem (when f(t) or g(f) is non-zero). 

We will assume here that b 4 0. If b = c = O then the equations are not coupled 
and we could solve for x and y separately using the integrating factor method, 
while if b = 0 but c 4 0 we can follow the method below swapping the roles of x 
and y. 
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Since b ¥ 0 we can rearrange the first equation of the pair in (26.1) to give y in 
terms of x and x, 


c—ax —fit 
pe (26.2) 
b 
Differentiating this will give us y in terms of x and x, 
oe f(t) 
= ; , 
and we can now substitute these into the second equation from (26.1) to give 
¥—ax— fit x —ax— f(t 
pe) =cx+d choses SU) + g(t). 
b b 
Rearranging this gives a second order equation for x, 
¥ —(a+d)x + (ad — be)x = f(t) + bg(t) — df(t). (26.3) 


We should be able to solve this to find x(t), using the techniques for second order 
linear equations we have already covered. Once we know x(t) we can then use the 
formula in (26.2) to work out y(f). 

Notice that it is not necessary to learn any new techniques to solve (26.1), apart 
from the ‘trick’ of substitution that is used here to turn the pair of equations into 
one second order equation. 


Example 26.1 By deriving the second order differential equation solved by x, find 
the general solution of the coupled equations 


K=xty 
y =4x —2y +4e7-7, 


Find also the solution that satisfies the initial conditions x (0) = 0 and y(0) = —1. 
Rearranging the first equation we can find y in terms of x, 

y=Hx-xX, (26.4) 
and so, differentiating this, 

YS xX K, 
Substituting these into the second equation gives 

#- k= 4x -2G@—x) + 4e~7! 
y y 

which simplifies to 


¥+x—6x =4e77", (26.5) 
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To solve this we first find the complementary function, i.e. the solution of 
Z4+z-62=0. If we try z(t) =e the resulting auxiliary equation is 
k* +k — 6 =0 with roots k = 2 and k = —3, yielding the complementary func- 
tion 


z(t) = Ae~ + Be7*". 


Since e~2 


ce~~! for a particular integral of (26.5). To obtain 4e 
need 


‘ is not a solution of the homogeneous equation we can try xp(t) = 
~2! on the right-hand side we 


4ce~*" — 2ce~# — 6ce~! = 4e~**, 

=) 

so c = —1, and xp(t) = —e"7 
x(t) is therefore 


is a particular integral. The general solution for 


x) = Aet Be Se, 
Having found x(t) we can now find y(¢) using (26.4), 
yit= Ac 3 Be 49s = [Ae 4 Be Se 
—_—_ COS T-”C CCC CON” 
x x 
= Ae” — 4Be~* + 3077", 
The general solution is therefore 


x(t) = Ae* + Be7*! — e~7! 

y(t) = Ae — 4Be7*! + 3e77!, 
Note that the constants in the solutions for x(t) and y(t) are the same A and B, 
and the solutions are tied together by the choice of these constants. 


The solution that satisfies the initial conditions x(0) = 0 and y(O) = —1 must 
have 


A+B-1=0 and A-4B+3=-1, 


i.e. A =Oand B = 1; itis 


x(@t)=e te 7% y(t) = 3e77 — de. 


Example 26.2 By finding the second order differential equation solved by x, find 
the general solution of the coupled equations 


x =2x+5y 
y= 2x. 


Find also the solution satisfying the initial conditions x(0) = 5 and y(O) = —4. 
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Rearranging the first equation we obtain y = £(x — 2x). Therefore 
oe (= 2%) , 
= =(* — 2x) = —2x, 
5 
and so 
x — 2x + 10x =0. 
To solve this we try x = e“ and obtain the auxiliary equation for k, 
k* —2k +10 =0. 


The solutions of this equation are 


and so 
x(t) =e'(Acos3t + B sin3r). 


Since y = 5 (x — 2x) the solution for y(t) is given by 


— 


y(t) = zl ‘T(3.B + A) cos 3t + (B — 3A) sin 3t] — 2e'[A cos 3t + B sin 3t}} 


t 
= (GB — A) cos3t — 3A + B) sin 3¢]. 


Since A and B are arbitrary constants, if we set A = 5C and B = 5D the general 
solution is 


x(t) = e'(5C cos 3t +. 5D sin 3r) 


y(t) = e'[(3D — C) cos 3t — (3C + D) sin 34]. eee) 
To ensure that x(0) = 5 and y(0) = —4 we need 
5C =5 and 3D-C= -4, 
so that C = 1 and D = —1. Thus the solution satisfying these initial conditions is 


x(t) =e! (5cos 3f — 5 sin3t) 
y(t) = —e’ (4 cos 3t + 2sin3r). 


The graphs of x(t) and y(t) against ¢ are shown in Figure 26.1. 


Fig. 26.1. 
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Graphs of x(t) [solid line] and y(t) [dotted line] against t. Both are 


oscillations whose amplitude increases exponentially. 


Exercises 


26.1 Find the general solutions of the following differential equations by converting them 
into a single second-order equation. Also find the solution that satisfies the given 


initial conditions. 


(i) 


x=4x-y 


x(0) =0 and y(0) = 1; 


ysxtyte, 


(ii) 


x =x —4y+cos2t 


x(0) = 1 and y(0) = 1; 


y=x+y, 


(iii) 


x =2x+2y 


x(0) = O and x(0) = 1; 


y=6x+3y+e’" 


(av) 


x = 5x —4y+e* 


x(0) = 1 and y(0) = —1; 


y=x+y, 


(v) 


x =2x+5y 


x(0) =2 and y(0) = —1; 


y = —2x + cos 3t, 
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(vi) 

x =x-+ y + e? 

y = 4x —2y+e7 y(0O) = —1 and x(0) = 1; and 
(vii) 

x = 8x + l4y 


= =1. 
y=Tx+y, x(0) = y(0) 
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The matrix approach to linear equations: 
eigenvalues and eigenvectors 


We now reconsider the coupled homogeneous linear system 


eens (27.1) 


y=cxt+dy. 
Our new approach may initially seem a little complicated, and it is not the best 
method if we want to find an explicit solution. However, it will enable us to draw 
the phase portrait (phase diagram) for this linear system after some fairly simple 
calculations. We will then use these linear phase portraits to analyse coupled non- 
linear systems. This graphical approach forms the main subject matter of all that 
follows. 


27.1 Rewriting the equation in matrix form 


The starting point is the observation that there is a much more compact way to 
write (27.1) using vector and matrix notation. If we write 


_ (x) 
x= (39) 
a b 
a= (02) 


(see Appendix B for some background material on matrices) then we can rewrite 
(27.1) as 


and define a matrix A by 


x = Ax. (27.2) 


Notice that since this equation is linear, we have a superposition principle; linear 
combinations of solutions will still satisfy the equation, i.e. if x; (¢) and x2(t) solve 
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(27.2) then so does 
x(t) = ax, (t) + Bxo(t), (27.3) 


since 
dx d ; ; 
ar gy om) + Bx2(t)] = wx) + Bx2 
= aAx; + BAX = A[ax; + Bx2] 
= Ax. 


Now, observe that the way equation (27.2) is written makes it look like the 
simple linear equation x = ax, whose solution we know is x(t) = Ce“. We will 
try to find a solution of (27.2) by guessing that it has the same type of exponential 
dependence on time (e*’ for some A); however, since x has two components the 
‘coefficient’ (C in the solution x(t) = Ce“) will have to be a constant vector v. 
Our trial solution is therefore 


x(t) = ev. (27.4) 
If we substitute this guess into (27.2) then we obtain 
d 
—[ey] = Afe*’v]. 
dt 
Since v is a constant vector the d/dt on the left-hand side only affects the expo- 
nential term: 
rey =e” Av. 
Dividing by the non-zero factor e*’ this becomes 
Av = Dv. (27.5) 


This is known as an eigenvalue equation. Any values of 4 for which this equa- 
tion has a non-zero solution for v are called the eigenvalues of A, and the corre- 
sponding v are called the eigenvectors of A. The remainder of this chapter dis- 
cusses how to find the eigenvalues and eigenvectors of A, and we return to their 
relevance to the differential equation dx/dt = Ax in the next chapter. 


27.2 Eigenvalues and eigenvectors 


For what values of A does the eigenvalue equation 


Av=Av (27.6) 
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have a non-zero solution for v? If we rewrite the equation as 
(A —ADv=0, (27.7) 


where I is the 2 x 2 identity matrix, 


1 0 
ty 3) 


and 0 = (0, 0), itis clear that if A — Al is invertible then the only solution is v = 0. 
So to have a non-trivial solution (v 4 0) we have to ensure that A — Al is singular, 
i.e. that 


det(A — AD = |A — All| = 0. 

Rewriting this in full it becomes 
a b\ (fa O*V_ 
ed O As| 


which gives a quadratic equation for A (‘the characteristic equation’) 


a-x 


b 
: 1. [=@- NE =) ~be=0 


7 —(a+d)r + (ad — bc) = 0. (27.8) 


The solutions of this equation are the eigenvalues of the matrix A, and for these 
eigenvalues we can expect to be able to find non-zero vectors v that have Av = Av; 
these are the corresponding eigenvectors. Since the eigenvalues are the solutions 
of a quadratic equation, they could be real and distinct, there could be only one, 
or they could be a complex conjugate pair. These different possibilities will have 
different implications for the solutions of the original differential equation, and we 
examine these three possibilities in turn in the following three chapters. 

Given an eigenvalue 4 the simplest way to find the associated eigenvector 
Vv = (v1, v2) is usually to solve the rearranged version (27.7) of the eigenvalue 


equation, 
(A — all) () — 0. 
v2 


Note that if v is an eigenvector with 
Av = Av 
then any constant multiple of v is also an eigenvector, since 


A(cv) = cAv = caAv = X(cv). 
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A canonical (standardised) choice of eigenvector is the vector with unit length 
(there are in fact two of these in the direction of v), but often it is easier to work 
with an eigenvector of integer values (if there is one). 

This is all most easily illustrated by considering an example. 


Example 27.1 Find the eigenvalues and eigenvectors of the matrix 
(23) 
To find the eigenvalues of A we solve the equation det(A — AI) = 0; this is 
is a)-G alls s2al-° 
6 3 O 1X 6 3-2 . 
which gives the quadratic equation 
(2—A)3—-—A)—-12=0. 
Multiplying this out and simplifying we obtain 
w7M-—51-6=0 or (A—6)(A+1)=0. 


The solutions of this, A = 6 and A = —1, are the eigenvalues of A. 
Now that we know the eigenvalues we can find the corresponding eigenvectors. 


If we write 
( . ) 
v= 
U2 


then when 2. = 6 we need (A — 6])v = 0, ie. 
PBN if CON | 2 Pot) 2 1) = % 
6 3 0 6 ~\ 6 3 vw} 
This matrix equation gives us two equations relating v; and v2: 


—4v,; +2v2 =0 and 6v, — 3u2 = O. 


At this stage we will always obtain two equations, one of which is a multiple of the 
other. We need only consider one equation, then, and this tells us that v2 = 2v,; 
the general eigenvector corresponding to Aj = 6 is 


(21) =" (2): 
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We usually drop the arbitrary constant, and talk about (for example) ‘the eigenvec- 


tor’ 
1 
; 2 > 


remembering that in fact any multiple of (1, 2) is an eigenvector.! 
To check that v, really is an eigenvector, simply multiply it by A: 


wna (6 3) (2)=(si6)=(2)= 


So vj is indeed an eigenvector corresponding to the eigenvalue 6. 
For A = —1 we want the eigenvector v2 = (v1, v2) to solve [A — (—I)]v = 0, 


22M of Say” QD — 3° °2 v1\_9 
6 3 0 -l ~\6 4 vw) 
Again, there are two equations, with one a multiple of the other, 


3v, +22 =0 and 6v, +42 = 0; 


so v, and v2 are related by v2 = —3v,/2: the general eigenvector corresponding 
to the eigenvalue 4 = —1 is 


v4 - 1 
(a a 0) 


In this case we could choose 
ages 2 
ae eee 


as our representative eigenvector. Checking once more that this really is an eigen- 
vector, we have 


oe 2 2 2\ (4-6)\_(-2 oa 
FONG BIKES NID O PL Ng pe 
as required. 


We have now found the two eigenvalues and their corresponding eigenvectors, 


4, =6 with n=() and 42 =-—1 with wal). 


! Th some respects it is therefore more sensible to speak of an ‘eigendirection’, in that any vector v in the same 
direction as the vector (1, 2) satisfies Av = 6v. 
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Since we will be calculating many eigenvalues and eigenvectors in what follows, 
any work that we can save ourselves later on will be useful. To this end, we note 
here that there are some special cases in which it is possible to ‘read off’ the 
eigenvalues and eigenvectors from the matrix itself. 


Example 27.2 The eigenvalues of a diagonal matrix 

Ay O 

0 do 
are 41 and 2, with corresponding eigenvectors Vv, = (1, 0) and v2 = (0, 1), re- 
spectively. 


This is easy to check, either directly by matrix multiplication, or by applying 
the general method. 


Slightly less obvious, and therefore more useful, is the following. 


Example 27.3 The eigenvalues of both the matrices 


Ay bd ae A, O 
0 1) c A2 


are d1 and dp. For the first, the eigenvector corresponding to , is (1, 0); while for 
the second, the eigenvector corresponding to X2 is (0, 1). In each case the other 
eigenvector needs to be found using the standard method. 


The fact that A; and Az are the eigenvalues becomes obvious when you write 
down the characteristic equation; for example, if A is the matrix on the right then 
the eigenvalues A are the solutions of 


Ay —AX 0 


|A — All| = Pe 


= (A1 — A)(A2 — A) = 9, 


and hence A = A, or A = Az. That the eigenvector corresponding to Az is (0, 1) 
essentially follows “by inspection’, 


(2 JC) =(2,)=#()) 


The other eigenvector will need to be found via the standard calculation (find v 
with (A — A; Dv = 0). 


All the above examples have distinct real eigenvalues. We now look at an exam- 
ple in which there is only one eigenvalue. 
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Example 27.4 Find the eigenvalues and eigenvectors of the matrix 


5 4 
a=(} re (27.9) 


The eigenvalues A are the solutions of 


Bia “a4 
t,* Mea 


[=12-e249= 0-37 =0, 


thus A = 3 is a repeated eigenvalue. Since we have only one eigenvalue we can 
only expect to find one eigenvector v; if (A — AIl)v = 0 then 


and so v = (2, 1). 


It is also possible for the eigenvalues to be complex. Since they are the solutions 
of a quadratic equation, they will be a complex conjugate pair. The eigenvectors 
will now be complex too, and also come as a complex conjugate pair (so it will 
only be necessary to calculate one of them). 


Example 27.5 Find the eigenvalues and eigenvectors of the matrix 
ne) 
sees), 
The eigenvalues of A are given by the solutions of the quadratic equation 


2-A 5 
—2 -i2 


[= 2-9-2) 10-77-24 10=0 


The roots of this equation are the complex conjugate pair 


_ 2474-40 
7 e 


To find the eigenvector associated with | + 3i we have to solve (A — ADv = 0, 


156; 
1 — 31 5 v\ 
Grae ie 


(h:= 31) hy + 502. \) 0 
—2v, —(14 31). ] 


r =1+3i. 


or 
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(Although not entirely clear, the second equation is, as usual, a multiple of the 
first.) Using the first equation we must have 5v2 = (3i — 1)v,, and so we can take 


(x21) 


as a representative eigenvector with eigenvalue 1 + 31. Since 


2 5 5 \_ (10+15i-5\_ (54 15i 
230; ory © —10 = 5 
af 08 
=a+30(,,°,). 


(5, 3i— 1) is indeed an eigenvector corresponding to the eigenvalue 1 + 33. 
The eigenvector corresponding to 1 — 3i, the complex conjugate of 1 + 31, will 
be the complex conjugate of the eigenvector we have already found, 


(s'-1) 


To check that taking the complex conjugate works, we have 


2 5 §. \.. fi SisiS5\. (315i 
oe Oy Vat > —10 ~ {5 
. 5 
=a-3(_° 4), 


27.3 *Eigenvalues and eigenvectors with MATLAB 


as we should. 


It is easy to find eigenvalues and eigenvectors using MATLAB. If A is a square matrix 
then the command eig (A) will return the eigenvalues of A. 


>> A=[2 2; 6 3] 


A= 2 2 
6 3 

>> eig(A) 

ans = -l 


If you also want Matas to find the eigenvectors for you then you can obtain 
them by typing [V D]=eig(A). This will return two matrices, v and D, where the 
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two columns of V are the eigenvectors, chosen so that they have length one, and D 
is a diagonal matrix consisting of the eigenvalues. 


>> [V D]=eig (A) 


V = -0.5547 -0.4472 
0.8321 -0.8944 


Note that, as you would expect, the first column of V is the eigenvector corre- 
sponding to the eigenvalue in the first position in the diagonal matrix D. However, 
since the eigenvectors have been chosen so that they have unit length they can ap- 
pear to be quite complicated (in our analytical calculations we found eigenvectors 
(2, —3) and (1, 2)). You can also use MATLAB to find the ratio of the components 
of the eigenvectors to each other if you want to write the eigenvectors as a pair 
of integers. First type format rat, which tells MATLAB to display its answers as 
fractions in their lowest terms, and then calculate the ratio of the components of 
the eigenvectors, 


>> format rat 
>> V(2,1)/V(1,1) 


ans = -3/2 
>> V(2,2)/V(1,2) 


ans = 2 


The first calculation shows that (1, —3/2) is a possible choice for the first eigen- 
vector (which gives our (2, —3) after multiplying by two), while the second gives 
(1, 2) immediately. 


Exercises 


27.1 Find the eigenvectors and eigenvalues of the following matrices: 


(i) 
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(ii) 

a?) 

G4 
(iii) 

>) 

26°. 1 
(iv) 

9 2 

OS by? 
(v) 

7 1 

-4 ll 
(vi) 

on £2 

a co 
(vii) 

6 0 

G aay? 
(viii) 

Ae 2D 

i? 


| ( 
( 


(x) 
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Distinct real eigenvalues 


In this chapter, and the following two, our main aim is to show how to use a knowl- 
edge of the eigenvalues and eigenvectors of A in order to draw the phase diagram 
for the equation x = Ax. As in Chapter 7, this phase diagram will illustrate the 
qualitative behaviour of the solutions by showing a representative choice of the 
curves traced out by the solutions (x(t), y(t)), labelled with an arrow to indicate 
in which direction the solution moves as f increases. 

In each chapter we will examine one of the three possibilities (two distinct real 
eigenvalues, a complex conjugate pair of eigenvalues, or a repeated eigenvalue) 
and for each case we will show 


(i) how an appropriate change of coordinates, based on the eigenvectors of the 
matrix A, can be used to transform the differential equation into a standard, 
simpler (canonical) form; 

(ii) how to find the explicit solution of this simple form of the equation; 
(iii) how to draw the phase portrait for the simple equation; 


and hence 


(iv) how to find the explicit solution of the original equation; and 
(v) how to draw its phase portrait. 


Since we already have a reliable method for solving coupled linear equations, 
(ii) and (iv) will be much less important than (iii) and (v). Besides giving the math- 
ematical justification of step (v), the coordinate transformations required to sim- 
plify the equation also provide a very natural illustration of the notion of the Jordan 
Canonical Form of a matrix, an important topic in the theory of linear algebra. 

In the first of these three chapters we assume that A has two distinct real eigen- 
values 4; and Az, with corresponding eigenvectors v; and v2. 
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28.1 The explicit solution 


In this, the simplest case, we do not need to make a change of coordinates in order 
to find the form of the explicit solution given the eigenvalues and eigenvectors. 

In the previous chapter we found that when we tried x(t) = ev as a solution of 
the linear equation x = Ax (see (27.4)), this gave rise to 


Av = Av. 


Given the eigenvalues and eigenvectors of A this means that we have obtained two 
possible solutions of the differential equation, 


ety, and 25, 


We saw in equation (27.3) in the previous chapter that a linear combination of two 
solutions is still a solution; thus the general solution of dx/dt = Ax can be written 
as 


x(t) = Ae*!y,; + Be?" vo. (28.1) 


We now apply this method to a linear equation that is the homogeneous version 
of the equations in Example 26.1. 


Example 28.1 By finding the eigenvalues and eigenvectors of an appropriate ma- 
trix, find the general solution of the coupled system 


x=x+y 


y =4x -2y. veo) 


Rewritten as a matrix equation the problem becomes 


a ()=(4 -2)(): 


We will denote the matrix on the right-hand side by A. 
If we try a solution for the equation x = Ax of the form x(t) = e”“v, then as we 
saw above this leads to this eigenvalue problem 


Av = Av. 
To find the eigenvalues of A we solve the equation det(A — AI) = 0; this is 


es 1 


4 4/26 


which gives the quadratic equation 


C= e243) 40; 
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Multiplying this out and simplifying we obtain 
74+24-6=0. 


The solutions of this, 1 = 2 or A = —3, are the eigenvalues of A. 
Now that we know the eigenvalues we can find the corresponding eigenvectors. 


If we write 
() 
v= 
v2 
then when 1. = 2 we need (A — 2I/)v = 0, ie. 
ed hfe ON ee fol PV ag 
4 -2 0 2 —\ 4 -4 ve)” 
We obtain two equations relating vj and v2, 
Vy — v= 0 and Av, = 4u2 = 0, 


and so vy = v2. A representative eigenvector is therefore 


me) 


For A = —3 we want the eigenvector v2 = (v1, v2) to solve (A + 3Dv = 0, 


| ee OR J _{4 1 v1) _9 
4 -2 OB NE ae 
This provides just one equation relating vj and v2, 


4v,; + v2 = 0, 


ge 1 
coe ees" 
is an eigenvector. 


The general solution is therefore (cf. (28.1)) 


SO v2 = —4v1, and 


gor aft 
x(t) = Ae (1) +8e (2) 


28.2 Changing coordinates 


When we have two distinct real eigenvalues we will change to a coordinate sys- 
tem that uses the eigenvectors as axes. It is a general result that eigenvectors 
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Fig. 28.1. Writing x as a combination of v, and v2; here x = 2v, + Vo. 


corresponding to different eigenvalues are linearly independent (see Appendix B), 
which means that any vector can be written as an appropriate linear combination 
of v, and vo, 


x= (*) — iv) + iv, (28.3) 


see Figure 28.1. 
Writing (28.3) in full gives 


(*) ps ce ee 
y Xvi. + Yur 


(where v;; is the jth component of v;). We could also write this as 
Beer’ 
y v2 va) \y)’ 


x = [Vv] Vo] x, (28.4) 


or more conveniently 


where x = (x, y) and we are using the notation [v; v2] to mean the 2 x 2 matrix 
with columns v, and v2. To further simplify notation, we will write P = [v, v2], 
so (28.4) is just x = Px. To find x given x, we have to multiply both sides by the 
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inverse of P, 
k=P'x=[v vo] "'x. (28.5) 
In order to write the differential equation 
— = Ax (28.6) 


in these new coordinates, we need to calculate dx/dt in terms of x. Using (28.5), 
the original equation (28.6), and (28.4), we have 


dx == 4 ply 
dt dt 
=<" , dx 
dt 
= P-'Ax 
= P'APs, 


where we have used x = Px in the last line. We have obtained a new linear equation 
for x, 
dx 
— =P 'APX. 28.7 
rp x (28.7) 
Although this looks more complicated, if we now substitute for P we have 


PAP = [v, vo] [A[vy, v2] 
= [v1 vol [Av Avz] 


= [v1 vol! [Aivy Aaval. 


That A[v; v2] = [Av; Av2] follows from the definition of matrix multiplication 
(see Appendix B), and then since Av; = A;v; we have [Av; Av2] = [A;vj A2v2]. 
We can rewrite 


Ayuyy A2v21 Vit U21 Ay O 
[Ajvy Agv2] = ( ) = ( ) ( 
Ajuj2 A2v22 VI2_—-V22 O dA2 


: 0 2 : 


—1 = rer Ar O\ (Ar O 
PAP=[vy vol” [vy val (7 Vf NOR oc” 
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Therefore the equation for x is 


dx Ay O)\. 
ooh = 28. 
a(0 i)* 288) 
which in terms of the coordinates x and y is 
dx /dt = A1x 
28. 
dy/dt = A2y. Ge) 


We have obtained equations for x and y that are no longer coupled. We can solve 
these equations easily to find the solutions 


X(t) = Ae*!" and ~—s H(t) = Be’. 


Note that if we use the expression x = Xv; + yv2 for x in terms of x and y (this 
was equation (28.3)) then we recover the general solution of the original problem 
as given in (28.1), 
x(t) = Ae*!y,; + Be*2! vo. 
We now apply these ideas to the differential equation we considered as Exam- 
ple 28.1. Note that this is not a sensible way to find the explicit solution of the 


coupled equations in the example, since we can write down this solution as soon 
as we know the eigenvalues and eigenvectors of A. 


Example 28.2 By means of an appropriate coordinate transformation decouple 
the equations 

K=xt+y 

y = 4x -2y 
and hence write down their general solution. 


Rewriting the equation in matrix form we have 


cet ea 
~\4 -2)° 


— 
A 


We found the eigenvalues and eigenvectors of the matrix A for Example 28.1: they 
are 


Ay =2 with n=({) and A2 = -—3 with n= ( ae 


Note that vj and v2 are linearly independent; if 


(Jer) 


28.2 Changing coordinates 275 


then a + 6 = 0 and a — 48 = 0. The only solution of these two equations is a = 
B = 0, and so the two vectors are linearly independent as claimed. 
In order to write a vector x in terms of these two eigenvectors, 


@ — tv) + vo, (28.10) 


we need 


(‘x = Px’). 
Multiplying the extreme left- and right-hand sides of this equation by P~! will 


give x in terms of x, 
#\f1 1\° 
re Sa he 


Computing the inverse we arrive at 


Le. 
ee 7, ~ dl 
x= —(4x+y) and y=s(x-y). 
5 5 
Referred to these new coordinate axes the equation becomes 


dx 


— =P 'APx 
dt 


ale nile Nile 


i at 
eS PP eS Lf 
ae es ee 
— — 
ee 
fn GO 
Nw Be 
eee ee eae 
Nw wv 
Sa eS 
amie e 
— 
i 

aN 
Na” 

war 


So in our new variables we obtain the decoupled equations 


a 


aces ares 
co oa y 
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The solutions of these can easily be seen to be x(t) = Ae” and y(t) = Be~*', and 
so the solution of the original equation can be recovered from (28.10), 


a 2t 1 —3 1 
x(t) = Ae ({) +8e eer 


28.3 Phase diagrams for uncoupled equations 


We have just seen that in a new coordinate system x = (%, ¥) that uses the eigen- 
vectors as axes the original equation becomes 


dk (r; 0). 
a= (3 oe (28.11) 


This gives the two decoupled equations dx/dt = 4,x and dy/dt = A2¥, whose 


solutions are 
X(t) =Xpe*" = and ~—f\(t) = Fe??!. (28.12) 


We now draw the phase diagram for (28.11), assuming that A) and A2 are both 
non-zero (for the exceptional case when one of them is zero see Exercise 28.6). 
We can use our explicit form for x(t) and y(t) in order to find the equation of the 
curves traced out by these solutions as ¢ increases.! Since 


@eyn _ ght 

x0 S 

(x2) - (oy. 
jo) \ Xo 


So the trajectories will trace out the curves 


we have 


j= Kx 2/1, 


where the constant K depends on the initial conditions (in fact we have K = 
yoXy r2/ ay Note that it is also easy to see from (28.12) that given an initial con- 
dition on one of the axes (x(0) = 0 or y(O) = 0) then the solution remains on the 
axis; we Say that the axes are invariant. 

If both eigenvalues are negative then x and y decrease to zero as ¢ increases. 


Suppose that A; < Az < 0; then Aj/Az > 1, and a typical phase portrait is shown 


! Tris also possible to derive this equation from (28.9), without finding the solutions first, by solving the equation 
dy/dx = A2¥/A1X, see Exercise 28.4. 
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Fig. 28.2. A stable node. 


Fig. 28.3. Plots of x(t) [solid line] and y(t) [dotted line] against t. Both x and y 
decay to zero, but note that x decays to zero much faster than y. (In the pictures 
dx /dt = —3x and dy/dt = —y.) 


in Figure 28.2 (like the graph of x = Ky’ withr > 1; a ‘rotated parabola’ when 
r = 2). In this case the origin is called a stable node. Note that the axes are invari- 
ant as remarked above. 

The axis associated with the faster rate of decay has been marked with two 
arrows on the figure; the solutions approach the origin, tangential to the direction 
corresponding to the value of 4 with smaller modulus. You can see the reason 
for this by imagining the x component of the solution decaying very fast, so that 
the trajectory gets close to the y-axis quickly, and then tends towards the origin 
‘almost vertically’. You can see this in Figure 28.3, which shows the graphs of 
x(t) and y(t) against t for some sample initial conditions. 

If both eigenvalues are positive then both x and y tend to too depending on 
the signs of the initial conditions. Suppose that A2 > A, > 0; then A2/A1 > 1 and 
solutions move on curves like those shown in Figure 28.4: the graph is similar to 
y = Kx’ withr > 1 (e.g. a parabola when r = 2). 
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Fig. 28.4. An unstable node. 


Fig. 28.5. A saddle point. 


The direction in which growth is faster has been marked with a double arrow; 
note that solutions move away from the origin tangent to the direction that cor- 
responds to slower growth (the smaller value of 4). In this situation the origin is 
known as an unstable node. 

If 4; and Az are of opposite signs then one of x and y increases, while the 
other decreases; there is one ‘stable direction’ and one ‘unstable direction’. For 
example, if Az <0 < A, then x(t) tends to too while y(t) tends to zero. The 
phase diagram is shown in Figure 28.5; the curves have equations like xy” = K 
(e.g. when r = | they are hyperbolae). The origin is called a saddle point, or 
just a saddle. (The behaviour here is like a ball rolling on a horse’s saddle, see 
Figure C.1, for example.) 


28.4 Phase diagrams for coupled equations 


Ah 


NO 


4 


Fig. 28.6. In the left-hand picture (x coordinates) the eigenvectors appear as 
orthogonal axes; in the right-hand picture the axes have ‘twisted’ back to their 
true positions, moving all the other trajectories too. 


28.4 Phase diagrams for coupled equations 


Once we have drawn the phase diagram for the (x, ») variables it is fairly straight- 
forward to draw the phase diagram for the original (x, y) variables. 

The transformation we used to change coordinates had the effect of making the 
eigenvectors of the matrix A the coordinates axes for the new variables x. This 
means that the x and jy axes, at right-angles in the phase diagrams we drew in Fig- 
ures 28.2, 28.4 and 28.5, actually correspond to the eigenvectors in the (x, y) plane. 

When we transform a picture in the (x, y) plane back to the (x, y) plane the 
axes will ‘twist’ and line up with the eigenvectors. To draw the phase diagram in 
the (x, y) plane we also have to ‘twist’ all the other trajectories accordingly; this 
is illustrated in Figure 28.6 for the case of a saddle point. 

In order to draw the phase diagram for a particular example it is not necessary 
to make the transformation to the new coordinate system, draw the phase portrait 
there, and then transform it back. Instead, once you have found the eigenvalues and 
eigenvectors first draw the eigenvectors (in the (x, y) plane), label the eigenvectors 
with an arrow in the appropriate direction (away from the origin if the eigenvalue 
is positive and so corresponds to an unstable direction, towards the origin if the 
eigenvalue is negative and thus corresponds to a stable direction), and then ‘fill in’ 
between the two eigenvectors in a consistent way. This idea is illustrated in the 
following example. 


Example 28.3 By finding the eigenvectors and eigenvalues of an appropriate ma- 
trix draw the phase portrait for the equation 


x=x+y 
y =4x -2y. 
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This is the same example we looked at above; we have already found the eigen- 


ae: ) , which are 


values and eigenvectors of A = ( fe 


A; =2 with n=({) and do =-—3 with a=(,). 


To draw the phase portrait, first we draw lines in the direction of the eigenvectors, 
and add arrows according to the sign of the eigenvalue, as in Figure 28.7. It is then 


rh L L Ll Ll Ll L 
4 3 2 1 0 1 2 3 4 


Fig. 28.7. First draw the eigenvectors and add arrows; away from the origin if 
A > 0, towards the origin if A < 0. 


Fig. 28.8. The phase portrait for the differential equation of Example 28.3. 
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2S 


Fig. 28.9. Stable manifolds for a stable node (the whole plane), a saddle point 
(the eigenvector corresponding to the negative eigenvalue) and an unstable node 
(just the origin). 


simple to add more illustrative trajectories to give the complete phase diagram of 
Figure 28.8. 


28.5 Stable and unstable manifolds 


We now introduce some terminology that will prove extremely useful when we 
come to study nonlinear systems. At present it may seem overly complicated, but 
it is easier to understand in these simple linear examples. 

The stable manifold? of the origin, written W‘(0), is all those points lying on 
trajectories that approach the origin as t — oo; for a stable node this stable man- 
ifold is all of R?; for a saddle this stable manifold is just the eigenvector corre- 
sponding to the negative eigenvalue; and for an unstable node the stable manifold 
is just the origin. These alternatives are illustrated in Figure 28.9. 

Note that when the origin is a saddle point its stable manifold separates the plane 
into two parts; in our picture all the points to the right of the stable manifold move 
away ‘to the right’ (so to x = +00), while all the points to its left move away 
‘to the left’ (to x = —oo). In this case the stable manifold plays a particularly 
significantly rdle and is often called the ‘separatrix’. 

The unstable manifold of the origin, written as W"(0), is not quite ‘all those 
points that move away from the origin’; it consists of all those points lying on 
trajectories that would approach the origin if we reversed the sense of time, i.e. re- 
versed their arrows. When the origin is a stable node its unstable manifold is just 
{0}; for a saddle point the eigenvector corresponding to the positive eigenvalue is 
the unstable manifold; and for an unstable node the unstable manifold is the whole 
of R?, see Figure 28.10. 


2A manifold is an abstract mathematical object; the easiest way of imagining it is as a generalisation of a 
‘surface’. In the relatively simple systems we are looking at here this generality will not be needed, although 
we will see some of the possible complexity in the final chapter when we consider the Lorenz equations. It is 
probably more helpful for now to think of WS(0) as the ‘stable set’ of the origin. 
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Fig. 28.10. Unstable manifolds for a stable node (just the origin), a saddle point 
(the eigenvector corresponding to the positive eigenvalue) and an unstable node 
(the whole plane). 


Exercises 


28.1 Write down the general solution and draw the phase portrait for the equation x = Ax, 
when the eigenvalues and eigenvectors of A are as follows. You should take particular 
care with stable (or unstable) nodes to ensure that the trajectories approach (or move 
away from) the origin tangent to the correct eigenvector. 

G) Ay =1,v, = C1, 1) anddAg = 2, v2 = (1, —1); 
Gi) Ay = 1, v1 = C1, 0) and Az = —2,w2 = C1, 1); 
(iii) Ay = —2, vy = (1, 2) and Az = —3, v2 = (2, —3); 
(iv) Ay = 3, v1 = (2, 3) andaA2 = —5, v2 = (0, 1); 

(v) Ay =3,v, = (1, 2) andAg = 1,v2 = (1, —3); 
(v1) Ay = 2, Vv, = (0, 1) andAz = —3, v2 = (1, 5); 
(vil) Ay = 1, vy = C1, 1) and Az = 2, v2 = (2, 1); and 

(viii) Ay = —3, v, = C1, 3) and Az = —1, vo = (—3, 2). 

28.2 For the following equations find the eigenvalues and eigenvectors of the matrix on the 
right-hand side, and hence find the coordinate transformation that will decouple the 
equations. Show that this transformation has the desired effect. (You can also write 
down the general solution and draw the phase portrait for the equation if you wish.) 


(i) 
dx | 8 14 
ap a a 
(ii) 
dx (2 0 
a3; °)s 
(iii) 


and 
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(av) 


(C) Given a matrix 


a b 
an (a 


the M-file Lportrait.m will draw the phase portrait for the linear equation x = Ax. 
The program draws the trajectory forwards and backwards from a given initial condi- 
tion, placing an arrow there indicating the direction the solution moves as ¢ increases. 
Draw the phase portraits for the equations in the previous exercise using this program. 
(T) Using the chain rule, if y = y(x(t)) then 


dy dy dx 
dt = dx dr’ 
from which it follows that 
dy dy /dx 
dx dt/ dt 
Therefore if 
ae =x and Y = hoy (E28.1) 
dt dt 
we have 
oy) Ay 
dx Ax 


Solve this to find the equation of the curves traced out by trajectories of (E28.1). 
(T) We have seen in this chapter that if A has distinct real eigenvalues 4; and 42, with 
corresponding eigenvectors vj and v2, then 


where P = [v1 v2]. It follows, conversely, that the matrix with these eigenvalues and 
eigenvector is 


(This is how the M-file makematrix.m constructs matrices with specified eigenval- 
ues and eigenvectors.) Find the matrices whose eigenvalues and eigenvectors are as 
follows: 

Gi) Ay = 3, v1 = (1, 2) and A2 = 6, v2 = (1, —1); 

(ii) Ay = 3, yy = C1, 0) and Ax = —1, v2 = (2, 1); and 
(ii) Ay =5,v, = (1, 1) anddAg = 1,v2 = (1-1, ). 
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(You could now check your phase portraits for Exercise 28.1, using the M-file 
makematrix.m to find the matrix with the specified eigenvalues and eigenvectors, 
and then lportrait.m to draw the phase portraits.) 


28.6 Suppose that A has two eigenvalues, 4; = 0 with eigenvector v; and A2 4 0 with 
eigenvector Vo. 


(i) Write down the general solution of the equation x = Ax. 


(ii) After changing to a coordinate system referred to the eigenvectors the equation 
will become 


dx dy 
rraees dt 
By solving these equations draw the phase portrait in the (x, y) system, and 
hence sketch the phase portrait for the original coordinates. 
(iii) Draw the phase portrait for the equation 


dx | —2 2 
ap Sy 


29 


More phase portraits: complex eigenvalues 


We have seen how to find explicit solutions of the equation x = Ax, and how to 
draw the corresponding phase portrait, when the matrix A has a pair of distinct real 
eigenvalues. In this chapter we treat the first of the two remaining possibilities; a 
complex conjugate pair of eigenvalues A+ = p + iw, with a corresponding pair of 
complex conjugate eigenvectors 74 = vj + iv2. 


29.1 The explicit solution 


If all you want is an explicit solution then it is unlikely that you would want to 
find the eigenvalues and eigenvectors and use the method here, since the method of 
Chapter 26 is much simpler. However, if you have the eigenvalues and eigenvectors 
of the matrix then this method will provide you with the explicit solution after only 
a little work. 

If we try to construct the general solution of x = Ax as we did in the previous 
chapter, by setting x(t) = Ce*+'n, + De*-'n_, we obtain 


x(t) = Ce? HO) Ty) + ive] + C*e "Ty, = ive]. 


Here we have taken the first coefficient to be a complex number C = a + if, and 
the second to be C* (its complex conjugate), which ensures that x(r) is real. 
We therefore have (since z* + z = 2 Re[z]) 


x(t) = 2Re [Cet (v, + iv>)]. (29.1) 
Since C = a + if this is 


x(t) = 2e”’ Re [(a + i8)(cos wt + isin wt)(v; + iv2)] 
= 2e?' Re[((a cos wt — Bsinwt) + i(B cos wt + a sinwt))(v; + iv2)] 


= 2e?"[(a cos wt — B sinwt)v; — (B cos wt + a sin wt)vo). 
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Since @ and # are arbitrary constants we can set A = 2a and B = —2f, so that 
the general solution is finally 


x(t) = e?'[(Acos at + B sinwt)v, + (B cos wt — Asinat)vo]. (29.2) 


We illustrate this method to find, once again, the solution of Example 26.2. It is 
probably better to go through the algebra each time than try to remember (29.2). 


Example 29.1 Find the general solution of the equation 


x =2x+5y 
y= —2x. 


We found the eigenvalues and eigenvectors of the matrix e 5) in Exam- 
ple 27.5, 


p+iw=1+3i witheigenvectors v, +iv2 = (jay +i($) : 


Therefore, using (29.1), 


wna fess[(5) (3) 
t . _ 5 . {0 
= 2e’Re {( + iB)(cos 3t + isin 3t) (>) +1 (5)/| 


= 2e'Re| [a cos 3t — Bsin3r) + i(B cos 3t — asin 31) | 


alep huey) 


= 2e! { («cos 3 — Bsin 3t) (2) — (B cos 3t — a sin3t) (5) 


With C = 2a and D = —2£ we obtain 
x(t) et 5C cos 3t + 5D sin 3t 
yt)) (3D — C)cos3t — (3C + D)sin3t )’ 


which agrees with the solution (26.6) that we obtained earlier. 
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Although the formula in (29.2) does provide us with an explicit solution, it is 
still not obvious how the solution behaves. Drawing the phase portrait will make 
things much clearer. 


29.2 Changing coordinates and the phase portrait 


We now make a coordinate transformation to put the equation into a standard form, 
choosing the real and imaginary parts of the eigenvectors, vj and v2, as our new 
axes. If you want to avoid the details, you should go to equation (29.6) which gives 
the form of the differential equation in these new coordinates. 

Before we make our change of coordinates, it will be useful to note that since 
74 = V1 + iv2 is an eigenvector of A with eigenvalue 44 = p + iw, we have 


A[v; + iv2] = (p +i@)[v) + iv2] = (pv) — wv2) + i(@vj + pv2). 
Taking real and imaginary parts of this equation gives 
Av = pv] — @v2 and Av2 = wv, + pv2. (29.3) 


It is relatively straightforward to check that vj and v2 are linearly independent, 
see Exercise 29.4, so we can write any vector as a linear combination 


X= Xv, + vo. (29.4) 


We saw above in (28.7) that if we make this coordinate transformation then we can 
write 


x =[v) v2]x 


and the differential equation satisfied by x is 


dx fs Z 
— =[v, v2] 'A[v) vol x. 
dt 


In the current case we have 


[vi vol [Alvi v2] =[vi vol '[Av; Avo] 


=[v1 vol '[pvi —@v2 wv + pvr] 


=[v1 v2) ‘Iv val ( : a 


288 29 Complex eigenvalues 


So we have obtained an equation for x which is in the standard, simpler form 


dx Pp @ 
—= x. 29.5 
dt ( -—® / ) . C22) 
Example 29.2 By means of an appropriate choice of coordinates transform the 
equations 
x =2x+5y 
y= —2x 


into the standard form 


di (1 3). 
de> so 


The eigenvectors were found in Example 27.5. They are 


Vv; £iv2 = Cy aG: 


corresponding to the eigenvalues | + 3i. We therefore set 
~{ 5 ~ {9 5 0). 
re) ee) 
— ee 


and so 


“3G (2 9G 3) 
(4 3) 


1/15 45 P 
~ 15 \-45 15 


as required. Note that, since A+ = 1 + 31, this agrees with (29.5). 
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(x,y) 


(0,0) 
0 


Fig. 29.1. Plane polar coordinates; x = rcos@ and y = rsing. 


Since the equation in the new coordinates is 


dx p @)\. 
—= ; 29.6 
dt ( 0). 5p ) : oe) 
the equations for the components x and y are 
a x + wy oe x + py (29.7) 
qe Rey ap Ok + PY: ; 


The easiest way to understand how the solutions of this equation behave is to 
change coordinates yet again, this time into plane polar coordinates. For k = (x, y) 
we set 


xX =rcosé and y=rsine, (29.8) 
see Figure 29.1. Then 
r=k4+y7 and 6 =tan !(¥/%). 


In order to find the equations satisfied by r and 6 we differentiate using the chain 
rule (see Appendix C), 


ra WE4259 = FoVEY 
r 
and 
ee Lee: 
“ES Gye ye Ta? 


Substituting in for x and y from (29.7) we get the simple system 


r=pr 


6 =-a. 
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Fig. 29.2. The phase portrait of a stable spiral point, and the graphs of a sample 
solution with x(t) (solid line) and y(t) (dotted line) plotted against r. Solutions 
tend to zero, oscillating as they go. 


These equations we can solve easily, ! 
r(t)=r@O)e" and = A(t) = (0) — a. 


The trajectories spiral round the origin with angular velocity —w, and when p 4 0 
the distance from the origin either increases exponentially to infinity if p > 0, or 
decreases to zero exponentially if p < 0. 

Note that once again the stability of the origin is determined by the eigenvalues. 
If the real part of the eigenvalues is negative the origin is stable, and called a stable 
spiral. This case is illustrated in Figure 29.2, with the graphs of x(t) and y(t) for 
some sample solutions shown alongside. 

If the real part of the eigenvalues is positive then the origin is an unstable spiral, 
as illustrated in Figure 29.3. 

When the eigenvalues are purely imaginary then p = 0 and so r = 0; the dis- 
tance r from the origin is constant. In this case the orbits are circles centred at 
the origin, as in Figure 29.4. Since the motion repeats itself periodically as it goes 
round and round the same closed curve, the solutions x(t) and y(t) repeated them- 
selves periodically, and such curves are called periodic orbits. In this case the 
origin is referred to as a centre. Note that a centre is stable (if you start close to it 
you stay close to it) but it is not attracting (trajectories do not tend to it). 


! Given the solutions in this form we can write down the general solution of our original equation in a more 
memorable way than we did in Section 29.1. It follows from (29.8) that 


X(t) = Re” cos(T — of) and = f(t) = Re™ sin(T — wt), 
where R = r(0) > 0 and T = 6(0) will be the arbitrary constants in our solution; using (29.4) we have 


x(t) = Re? [cos(T — wt)v, + sin(T — af)v2]. 
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Fig. 29.3. An unstable spiral, along with the graph of a sample solution against t 
(x(t) solid, y(t) dotted). Solutions oscillate, but the amplitude of the oscillations 
tends exponentially to infinity. 


Fig. 29.4. A centre; the phase portrait consists of closed curves around the origin. 
The solutions repeat periodically with period 27 /@, where w is the imaginary part 
of the eigenvalues, as shown on the right, where a sample solution (x (f) solid, y(t) 
dotted) is plotted against rt. 


29.3 The phase portrait for the original equation 


Note that in all three of these cases the réle played by the eigenvectors (rather 
than the eigenvalues) is not as clear as it was in the case when the eigenvalues 
were distinct real numbers. Indeed, ‘twisting’ any of these pictures will still leave 
them qualitatively the same (although generally circles will be deformed into el- 
lipses). You can therefore base your phase portrait for the original equation entirely 
on the eigenvalues and still obtain an accurate picture. We illustrate this for the 
example (27.5) that started this section. 
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Fig. 29.5. How to find out which way orbits rotate around the origin. 


Example 29.3 Draw the phase portrait for the equation 


k= ( = 5) x. (29.9) 


We have already found the eigenvalues of the matrix that occurs in this equation: 
they are 1 + 3i. So the origin is an unstable spiral. Since the imaginary part (which 
is what makes the trajectories “spin’ around the origin) is +i, you cannot just read 
off the direction in which the trajectories ‘rotate’. The best way is to concentrate 
on a line level with the origin, either x = 0 or y = 0, and by looking at y or x 
work out the direction in which the trajectories are going. If you look on x = 0 
with y > 0 then x = y > O and trajectories are moving to the right, so the rotation 
is clockwise; similarly on y = 0 with x > 0 we have y = —2x < 0, so the motion 
is downwards, which once again shows that trajectories are rotating clockwise. See 
Figure 29.5. 

It is now simple to draw the phase portrait, which is shown in Figure 29.6, along 
with the graphs of x(t) and y(t) for a sample solution. 


Exercises 


29.1 Draw the phase portrait for the equation dx/dt = Ax, when the eigenvalues (A+) and 
eigenvectors (74) of A are as follows. Also given is the sign of x when x = 0 and 
y>0. 

G) A+ = 1+ 3i with ny = C1, 241), x <0; 
Gi) Ax = £31 with ny = (1 £21, 1 F 31), x < 0; 


Exercises 293 


2000 


1500 


1000 


500 


0 


—500 


1000 
0 1 2 3 


Fig. 29.6. The phase portrait for equation (29.9), and the graph of a sample solu- 
tion plotted against ¢, with x(t) solid and y(t) dotted. 


(iii) Az = —2 bi with ny = (1 Fi, 340, 4 > 0; 
(iv) Az = —1 tiwith yn, = (1, +i), x > 0; 
(v) Az = 242i with ny = (43i, 5 $ 4i), x < 0; 
(vi) At = 543i with ny = (245i, Fi), x < 0; 
(vii) Ax = £7i with ny = (1 +i, —1 + 2i), x > 0; and 
(viii) AZ = —13 + 17i with nz. = (461 — 8, 4 $51), % > 0. 


29.2 Write down the general solution of the equation dx/dt = A when the eigenvalues 
(Az) and eigenvectors (7) of A are those in the previous exercise. 

29.3 For the following equations find the eigenvalues and eigenvectors of the matrix on 
the right-hand side, and hence find the coordinate transformation that will put the 
equations into their standard simple (canonical) form. Show that this transformation 
has the desired effect. 


(i) 
dx _ 0 -!il 
Ge NA at 
(ii) 
de: 6 f =-2.3 
ap \6:-4y 
(iii) 
dx —-11 -—2 ‘ 
de \ 13 -9)” 
and 
(iv) 
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29.4 (T) In the previous chapter we used the result that the eigenvectors corresponding to 


29.5 


distinct eigenvalues are linearly independent. Use this result to show that the real and 
imaginary parts of complex eigenvectors are linearly independent. 
(T) Following the same line of reasoning as in Exercise 28.5, show how to construct a 


matrix with a complex conjugate pair of eigenvalues 44 = op + iw and corresponding 


eigenvectors 74 = Vv; + iv2. Hence find the matrices with the following eigenvalues 
and eigenvectors: 
G) A+ = 3+ 3i with ny = (241, 1 Fi); 
(i) A+ = £31 with ny = (41, 3 + 21); and 
(ii) Ak = —2 tiwith ny = (1 +i, 1 Fi). 
(The M-file makematrix.m will do this for you. You could use this to check that 


the signs of x given in Exercise 29.1 are correct by finding the appropriate matrix A 
and then looking at x when x = 0 and y > 0.) 
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Yet more phase portraits: a repeated real eigenvalue 


We now treat the final case, in which the matrix A has a non-zero repeated real 
eigenvalue. There are two very different situations in which this can happen, and 
we will treat them separately. The case of a repeated eigenvalue zero is the subject 
of Exercise 30.5. 


30.1 A is a multiple of the identity: stars 
The first possibility is that A is a multiple of the identity, 


r(6 9) 


In this case the equation x = Ax decouples with no extra work required on our 
part, 


x =x y=dy. 


(In fact the equation would decouple in any coordinate system.) For such a matrix, 
for any vector v we have Av = Avy, and so all vectors are eigenvectors. 

The phase portrait is particularly simple. Since any vector is an eigenvector, 
x(t) = Ae”v is a solution for any V; in particular x(t) = e*’x(0) is a solution, and 
so solutions move on lines emanating from the origin. Depending on the sign of A 
we have a stable or unstable star; the phase portrait for the stable case is shown in 
Figure 30.1. 


30.2 A is not a multiple of the identity: improper nodes 


When there is just one eigenvalue and the matrix is not a multiple of the identity 
then things are more difficult. In this case we will only be able to find one eigen- 
vector. Even though we only have one eigenvector it is still possible to find a new 
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Fig. 30.1. A stable star. 


coordinate system in which the matrix takes a ‘canonical form’, although the ar- 
gument is a little subtle. 

We will suppose that the eigenvalue is 4 and that the corresponding eigenvector 
is v. To begin, choose any vector v2 that is not in the same direction as v. Since 
these two vectors are linearly independent they form a basis for R*, and we can 
write any vector x as a linear combination of v2 and v. 

We now show that vj = (A — Al)vz is in the same direction as V, i.e. is an eigen- 
vector. To see this,! we write vj as a combination of the vectors v2 and v, 


(A — ADvo = avo + Bv, 


and apply A — AI to both sides. The term on the right-hand side involving v van- 
ishes (since v is an eigenvector with eigenvalue 4) and we obtain 


(A —AD[(A — ADv2] = a[(A — Alva], 
or 
(A — ADv; = avy. 


But this says that Av; = (A + @)vj, ie. that v; is an eigenvector with eigenvalue 
A. +a. Since there is only one eigenvalue we must have a = 0 and v; must be an 
eigenvector lying in the direction of v. Note that since v; = (A — All)v2 we have 


Avo = vj + Avo. (30.1) 


! « much more elegant, but less elementary, approach is to use the Cayley—Hamilton Theorem, see Exercises 30.2 
and 30.3. 
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We now refer our coordinates to the axes v; and v2, 
X = XVj + Yvo, 


and then, using (30.1) in the second line, 


d 
= =[v) vo} Aly vo]x 


= [v1 val | [Av vi + Avol® 


Xn 1\L 
=[v1 v2] ‘Ivy val (4 ae 


In the new coordinates our original equation becomes 


dx A 1\. 
—= : 2 
= (5 a)? 02) 
Writing this as a coupled system gives 
x = AE ~ 
ae (30.3) 
yay. 


Note that although the equations have not completely decoupled, we can solve 
the ¥ equation on its own; its solution is f(t) = Ae”. Substituting this for ¥(t) in 
the equation for x, 


ag 
Lee wet 
dt 


Xr 


This is a linear equation, and using the integrating factor e“’ we have 


Se) = A, 
Integrating between 0 and t we obtain e~* x(t) = At + B, and finally 
X(t) = Be“ + Ate”. 
This means that the solution in the original variables is 
x(t) = [Ate™ + Be ]v, + Ae vo, (30.4) 


where to use this formula you need to remember that v; = (A — All)v2. You are 
probably better off using the ‘second order equation’ method of Chapter 26 if you 
want to find an explicit solution. 

Drawing the phase diagram in this case is a little more difficult than before, 
even with the exact expressions for the solutions. You can get some idea of how 
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Fig. 30.2. The phase portrait for an improper node for A > 0. On the dotted line 
dx/dt = 0. 


the picture should look as follows; we assume here that A is positive. Note first that 
dy/dt is positive when y is positive, and negative when y is negative; also, dx /dt 
is positive while y > —Ax, and is negative when y < —Ax. Trajectories cross the 
line y = —Ax vertically. You should be able to put these ingredients together to get 
something like the collection of ‘S’ shaped trajectories shown in the phase portrait 
of Figure 30.2. The stability of the origin, referred to in this case as an improper 
node, depends on whether A is positive or negative. 


Example 30.1 Draw the phase portrait for the equation 


dx 5 -4 
7 é i )x (30.5) 


We found the (repeated) eigenvalue 2 = 3 of the matrix (; a) above (see 


Example 27.4), and its solitary eigenvector (2, 1). To draw the phase portrait we 
first draw the eigenvector and label it with arrows moving away from the origin 
(since the eigenvalue is positive). To work out whether the trajectories move on 
a collection of ‘forwards’ or ‘backwards’ S shapes we can use the same method 
as we did for the rotating cases; above the stationary point (x = 0 and y > 0) 
we have x = 5x — 4y = —4y <0 and so the trajectories are moving left, see 
Figure 30.3. 
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-1 -0.5 0 0.5 1 


Fig. 30.3. On the left are the steps for drawing the phase portrait for equation 
(30.5). On the dotted line x = 5x — 4y = 0; when y > 0 and x = 0 we have x < 
0. The phase portrait is on the right. 


Exercises 


30.1 Find the eigenvalue and eigenvector of the matrices occurring in the following equa- 
tions, and hence draw the phase portrait. Find also the coordinate transformation that 
will put the equation into canonical form and show that this works. Write down the 
general solution. 


(i) 


(ii) 
dx —6 2 
di k=2- ao) 
(iii) 
dx _ —-3 -!1 
FV We ae 
(iv) 
dx (13 0 7 
de \O 13) 
and 


(v) 
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30.2 (T) The characteristic equation for a 2 x 2 matrix 
a b 
A= 
(ao 


k* — (a+ d)k + (ad — bc) = 0. 


is |A — kl| = 0, ie. 


By explicit calculation show that A satisfies its own characteristic equation, i.e. that 
A? — (a+ d)A + (ad — bc)1 = O, 


where O is the 2 x 2 matrix of zeros.This is a particular case of the Cayley-Hamilton 
Theorem. 

30.3 (T) If A has a repeated eigenvalue 4 with eigenvector v then its characteristic equation 
can be written 


(k —d)? =0. 
Use the Cayley-Hamilton Theorem from the previous exercise to deduce that 
(A — Al)? = O, 


and hence that (A — i1)x is an eigenvector of A for any choice of non-zero vector 
xv. 

30.4 (T) By following the ideas of Exercise 28.5, show how to construct a matrix with 
a single eigenvalue 4 and corresponding eigenvector v. (There will be many such 
matrices.) Find two matrices with eigenvalue —1 and eigenvector (1, 1). 

30.5 (T) Suppose that the matrix A has zero as a repeated eigenvalue, with eigenvector 
v. Then we can change to coordinates referred to v2 and vj = Avz2, where v2 is any 
vector in a different direction to v, so that k = xv, + yv2. The equation becomes 


and so 


(i) Solve the equations for x(t) and y(t), and hence write down the general solution 
for x(t). 
(ii) Draw the phase diagram in the (x, y) plane, and hence in the (x, y) plane. 
(iii) Draw the phase diagram for the equation 


dx | —-1 1 
ae Vet a 
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Summary of phase portraits for linear equations 


When confronted with an example dx/dt = Ax you should first calculate the 
eigenvalues of A, and then if they are real calculate the eigenvectors. With this 
information you can draw the phase portrait. The various possibilities are sum- 
marised below, and illustrated in Figure 31.1. 


(i) Distinct real eigenvalues 
© Ay < Az < 0 gives a stable node: all trajectories approach the origin, tangent to the 
eigenvector corresponding to i2. 
° A, > Az > O gives an unstable node: all trajectories move away from the origin, 
tangent to the eigenvector corresponding to A2. 
° A, <0 < Az gives a saddle: the only trajectories to approach the origin are those 
starting on the ‘stable eigenvector’, while all other trajectories move away. 
(ii) Complex conjugate eigenvalues p =: iw (to find the direction of rotation you need to 
check, for example, the sign of x on the line x = 0) 
¢ p <0 gives a stable spiral: all trajectories spiral into the origin. 
¢ p > 0 gives an unstable spiral: all trajectories spiral out from the origin. 
¢ p=0,ie. A = ia, gives a centre: trajectories close and we have a family of peri- 
odic orbits. 
(iii) A repeated real eigenvalue 
¢ The matrix is a multiple of the identity: we have a stable or unstable star depending 
on the sign of the eigenvalue (A < 0 gives stability, which is the case shown in 
Figure 31.1). 
¢ The matrix is not a multiple of the identity: we get the S-shaped phase portrait of an 
improper node, whose stability depends on the sign of A (stable for 1 < 0). 


31.1 *Jordan canonical form 


The coordinate transformations that we have used in the previous three chapters 
are those that put the matrix A into its Jordan canonical form. In order to ex- 
plain this, we first need to discuss the relationship between matrices and linear 
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Stable node Unstable node Saddle point 
Stable spiral Unstable spiral Centre 
Stable star Stable improper node Unstable improper node 


Fig. 31.1. Sample phase portraits listed by the stability type of the stationary 
point: (i) distinct real eigenvalues on the first row, (ii) complex eigenvalues on the 
second row, (iii) equal real eigenvalues of the third row. 


transformations. We will only do this for the case of linear transformations of the 
plane. 


31.1.1 Representation of vectors in different coordinate systems 


A vector x lying in the plane can be specified without reference to a coordinate 
system. For each particular choice of coordinate axes, x will be represented by a 
different pair of coordinates. For example, if x has coordinates (x, y) with respect 
to the coordinate axes e; and e2 this means that 


X= xe, + yeo, 
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while with a different choice of axes, v; and v2, say, the same vector would have 
coordinates (x, y), i.e. 


X = Xv, + yvo. 


Writing both expressions for x in matrix form we have 


[e; e2] (*) =[v vo] O) : 
y y 


which enables us to transform between the two different coordinate systems, 


(JG) = G-eG) 


where S = [e; e2]~![v,_ vo]. 


31.1.2 Linear transformations of the plane and 2 x 2 matrices 


A transformation of the plane L : x +> L[x] is linear if 
Llax + By] = aL[x] + BLly] 
for all x, y € R? and all a, B €R. In particular, this means that 
L[x] = L[xe; + ye2] = xL[e;] + yL[ez], 


and so in order to work out L[x] we only need to know L[e;] and L[e2]. 
For each choice of coordinates axes we can find a matrix A such that if x has 
coordinates (x, y), then L[x] has coordinates 


(>): 


For example, if we work in the ej—e2 coordinate system, in which (x, y) represents 
the point x = xe; + yep, then if 


Lie;] = a1, e) + a21e2, Lie2] = ay2e) + a22e2, 


and we set 


a a 
he 11 12 ; 
a21 422 
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A ) = ee + a) 
y a21x + a22y 
-+(ci)e9(2) 
a2\ a2 


which is how we write x L[e;] + yL[e2] = L[x] in the ej—e2 coordinate system. 
If we were to change the coordinate axes then the matrix representing the trans- 
formation L would have to change too. To find the new matrix, suppose that 
the point x has coordinates (x, y) with respect to the axes vj and v2, so that 
X = XV, + Yv2. Then its coordinates (x, y) with respect to the axes e; and e2 are 


y y 


(see (31.1)). We know that in the e;—e2 coordinate system, the coordinates of L[x] 
can be found by multiplying by A, and so are 


as(7). 
y 
Using (31.1) again the coordinates of the vector L[x] in the vj—v2 coordinate sys- 
tem are 
s'as(7). 
y 


The upshot of this is that the transformation L is represented in the new coordinate 
system by the matrix 


we have 


B=S7'!AS. 


31.1.3 Similar matrices and the Jordan canonical form 


We have seen that if two matrices A and B represent the same linear transformation 
with respect to different coordinate axes then for some non-singular matrix S we 
have 


B= S—'AS. 


Such matrices are said to be similar. 
It is a natural question whether there is a particular choice of matrix S that will 
find the ‘simplest’ (in some way) matrix that is similar to a given matrix A. In other 


Exercises 305 


words, whether there is a ‘natural’ coordinate system in which the linear transfor- 
mation L can be easily expressed. 

There is such a form, known as the Jordan canonical form. All similar matrices 
have the same eigenvalues, and the simplest matrix depends on these eigenvalues. 
It is always one of the following possibilities, which should by now be familiar. If 
A has distinct real eigenvalues A; and A2 then the Jordan canonical form is 


isp, 20%, 
0 dag? 


if A has a complex conjugate pair of eigenvalues p + iw then the canonical form 
is 


and if A has only one eigenvalue 4 then the canonical form is either 


(oa) * Ga) 


These possibilities follow from an analysis that closely parallels that of the previ- 
ous three chapters. 


Exercises 


31.1 Draw the phase portrait for the equation dx/dt = Ax when the eigenvalues and eigen- 
vectors of A are the following: 
(i) Ay = 3 with vj = (1, 1) and Az = —2 with v2 = (1, —2); 
(ii) complex conjugate eigenvalues A+ = —1 + 31, with x <0 when x = 0 and 


y > 0; 

(iii) a single eigenvalue A = 13 with eigenvector (3, 2), and x > 0 when x = 0 and 
y > 0; 

(iv) Ay = —2 with v; = (2, 1) and Az = —3 with v2 = (1, —1); 

(v) a single eigenvalue A = —3 with eigenvector (1, —1), and x > 0 when x = 0 
and y > 0; 


(vi) A = +21, where y < 0 when y = 0 and x > 0; 
(vii) Ay = 1 with v; = (3, 2) and Az = 5 with v2 = (1, —4); 
(viii) A = 5 +i, and y > 0 when y = 0 and x > 0; and 
(ix) a single eigenvalue 4 = —7, with the matrix A a multiple of the identity. 


Part VI 


Coupled nonlinear equations 
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Coupled nonlinear equations 


We now turn our attention to coupled nonlinear systems. We will concentrate on 
autonomous systems in which the right-hand side does not depend explicitly on 
time, 


dx /dt = f(x, y) 


32.1 
dy /dt = g(x, y). ee 


Using the vector notation x = (x, y) and f(x) = (f(x, y), g(x, y)), this equation 
can be rewritten 


dx 
apo f(x). 

Our approach will be to try to understand the dynamics of these equations (the 
behaviour of their solutions) in a qualitative way by drawing the phase diagram 
in the (x, y) plane (‘the phase plane’), just as we have done for linear equations 
in the past three chapters. We will find that we can piece together the phase por- 
trait for nonlinear systems from a collection of phase portraits for linear (or near- 
ly linear) systems near the stationary points. 


32.1 Some comments on phase portraits 


A stationary point is a point (x*, y*) at which x = y = 0, i.e. where 
fa*,y)=0 and g(x*,y") =0. 


Because solutions are unique, it follows that if (x*, y*) is a stationary point then 
solutions starting at (x*, y*) remain there for all time. The phase portraits we drew 
in the previous chapters were fairly simple, since we only ever had a single sta- 
tionary point at the origin. 
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Fig. 32.1. Trajectories cannot cross; xo must be a stationary point. 


Our phase portraits will show the stationary points marked by crosses, and in- 
clude a representative collection of trajectories (the curves traced out by solutions 
as they change in time) with the direction of increasing time indicated by an arrow. 
As with the phase diagrams we drew in Chapter 7 we lose any information about 
the rates at which the solutions change, but may be able to understand their be- 
haviour much more easily from the diagram than we would from an explicit solu- 
tion. 

An important point to notice is that curves in the phase diagram cannot cross 
one another; this is a consequence of the uniqueness of solutions. If we had a 
situation like that illustrated in Figure 32.1, with two curves emanating from a 
point xo, then there would be two solutions starting at xo; this cannot happen, 
since solutions are unique. The only way that we can get the kind of situation 
pictured is if xo is a stationary point. If this is the case there is no contradiction, 
since although the trajectories meet at xo, they do not actually pass through it. (If 
x(s) = Xo for some s, where Xo is a stationary point, then we must have x(t) = xg 
for all ¢.) In all our phase diagrams for linear equations (see Figure 31.1) there 
are apparent ‘crossings’; but they all occur at the origin, which is a stationary 
point. 


32.2 Competition of species 


We will illustrate the general method by considering a simple ecological model for 
two species that are competing for the same resources, e.g. a herd of sheep and 
cows grazing over the same fields. If we denote the numbers of the two species 
(measured, let us suppose, in hundreds) as x(t) and y(t) then in isolation we might 
expect the size of both populations to obey the logistic equations 


x=x(A—ax) and y=y(B—dy) (32.2) 
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x x 


Fig. 32.2. The phase portrait for either species in isolation; the left stationary 
point represents a population of zero, the right-hand one the equilibrium value 
(x = A/a or y= B/d). 


(with A, B, a, and d positive), cf. Section 8.5. In particular these equations predict 
that left to themselves each species would settle down to a constant population 
(for the first, x = A/a, and for the second, y = B/d as can be seen from the phase 
diagram in Figure 32.2). 

However, since there are limited resources both species will be disadvantaged 
by the presence of the other. So we would expect the model 


x=x(A -ax -—by ) 
See 
y inhibits x 
va) ae = 
y=y( cx dy) 
x inhibits y 
with b and c positive to reflect this. 
In this chapter we will consider these equation with a particular choice of the 
parameters A, B,a, b,c, and d, 
x =x(8 —4x —y) 
y = yGB— 3x — y). 
For these models we are only interested in the behaviour of the solutions for 


x, y => 0, since we want our populations to be positive. (It should be clear, however, 
that the equations are mathematically sensible for any values of x and y.) 


(32.3) 


32.3 Direction fields 


Given a coupled pair of nonlinear equations (similar to (32.3)) we can get a very 
good indication of what the phase portrait should look like by drawing the direction 
field. This is a set of arrows pointing in the direction of the vector x and whose 
length is proportional to the magnitude of x. The direction field shows the direction 
in which solutions move, and how fast. The curves traced out by solutions will be 
everywhere tangential to the direction field, since the field shows how they are 
moving instantaneously, see Figure 32.3. 

The direction field for equation (32.3) is shown in Figure 32.4, which was pro- 
duced using MATLAB’s quiver command. 


>> [x, yl] = meshgrid(0:1/3:2.5, 0:1/3:3.5); 
>> xd = x.*(8-4*x-y); yd= y.*(3-3*x-y); 
>> quiver (x,y,xd,yd) 


312 32 Coupled nonlinear equations 


> 
gees ——s 
\ 
Me ’ ‘ 
‘A 
\ Y a 
J 
Rn “Pa a 
a we 


Fig. 32.3. A fanciful trajectory shown as a dotted line, and some arrows from 
the direction field tangential to it. The solution moves faster along the trajectory 
where the arrows are larger. 
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Fig. 32.4. The direction field for equation (32.3). 


Having drawn the direction field, it should be relatively easy to ‘join the dots’ 
and draw the phase portrait as in Figure 32.5. 

Of course, in practice it is not really convenient to draw so many arrows of the 
direction field unless you have access to a computer package. However, a related 
approach that can be useful more generally is to draw the ‘nullclines’. These are 
the lines (or curves) on which x or y is equal to zero. For our example x = 0 when 
x =0 or when 8 — 4x — y = 0, and y is zero when y = 0 or 3—3x —y = 0. 
These nullclines are shown in Figure 32.6, along with some sample trajectories. 
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0K 0.5 1 1.5 2 2.5 


Fig. 32.5. The phase portrait for equation (32.3). 


os 
Fig. 32.6. The ‘nullclines’ for equation (32.3). The dashed line is 8 — 4x — y = 
0, on which x = 0, and the dotted line is 3 — 3x — y = 0, on which y = 0. So, 


from left to right, we have regions where x and y are both increasing; where x 
increases but y decreases; and where x and y are both decreasing. 
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With some thought (and a lot of imagination) it might be possible to draw the phase 
portrait if you know where the nullclines are. (We used the nullclines to draw the 
phase portrait for the linear system with a repeated real eigenvalue in Section 30.2.) 

However, we will concentrate here on the analytical method that enables us to 
draw qualitatively accurate ‘sketches’ of the phase diagram using our knowledge 
of linear systems. 


32.4 Analytical method for phase portraits 
The method has four stages: 


¢ find all the stationary points, 

¢ linearise near the stationary points, 

¢ draw the phase portrait near the stationary points, and then 

¢ join up these ‘local’ phase portraits to give the global picture. 


32.4.1 Step 1: find the stationary points 


Recall that the stationary points are those points at which x and y do not 
change, i.e. those points (x*, y*) such that f(x*, y*) = g(x*, y*) =0. For 
equations (32.3) we therefore need (x*, y*) to satisfy 


x*(8 — 4x* — y*) =0 and y*(3 — 3x* — y*) =0. 


We can satisfy the first equation if we choose x* = 0. The second equation is then 
satisfied if either y* = 0 or y* = 3. The choice y* = 0 for the second equation also 
allows x* = 2 as a solution of the first equation. A final possibility is provided by 
the solution of the simultaneous equations 


8 —4x* — y* =0 and 3 —3x*-—y* =0 


gives x* = 5 and y* = —12. Since we are only interested in stationary points with 
x* and y* non-negative, we can concentrate on the three stationary points 


(0, 0), (2, 0) and (0, 3). 


32.4.2 Step 2: linearise near the stationary points 


The next step is to determine the stability of the stationary points by linearising. 
The idea, essentially, is to look at what happens ‘near to’ the stationary points. 

To do this we suppose that x(t) is close to a stationary point x* = (x*, y*), and 
write 


x(t)=x" +E) and y(t) = y* + n (0), (32.4) 
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A 


(x*+5,y"*47) 


(x*y’) g 


Fig. 32.7. Choosing new coordinates € = x — x* and n = y — y* near a station- 
ary point (x*, y*). 


where &(t) and n(t) are small, see Figure 32.7. The new coordinates (&, 7) treat 
(x*, y*) as the origin. 

We now write down the equations satisfied by &(t) and y(t). Since x* and y* 
are constants, it follows from differentiating the two equations in (32.4) that 


xX=E and y=n. 


Therefore 


PSS (ES ie See eH) 
NSV= 26,9) =o KE a): 


Now we use the Taylor expansion of f and g about (x*, y*). Recall (or see 
Appendix C) that the Taylor expansion of a function f of two variables is given by 


fat+éytn=ft feE+ font 5 fexe? + feyEn + 5 fyyn? ++ 


where f, = 0f/0x, etc., and all the partial derivatives on the right-hand side are 
evaluated at the point (x, y). Then 


af af 
E= f(x", y*)+ age 48 aa ay te 
. 0g ag 

n= a3", y*)+ ae y yet ay PAE 


‘ > 


where the ‘---’ are terms of higher order in € and n. If € and n are sufficiently 
small we would expect that we can ignore the higher order terms and still have a 


good approximation of the rate of change of &(t) and n(f). 
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Since x* is a stationary point we have f (x*, y*) = g(x*, y*) =0, and so 
af af 


ae .y ee a syn 


: 0g dg 
A= Oy VE Oy) ®- 
Ox oy 


Notice that this is a linear equation which we can rewrite in matrix form as 
> Ce es e 
|= : (32.5) 
i dg/dx dg/dy n 


Alternatively, with € = (€, 7) and using the compact notation 


lor, y*) 


af/ax daf/ ?) 
Df x* => ( ’ 
ee) dg/dx dg/dy eas 
this becomes 
dg 
— = Df(x*) é. 
ar (x")€ 


This equation is known as the ‘linearisation’ of (32.1) about the stationary point 
(x*, y*), since by considering only solutions ‘sufficiently close’ to (x*, y*) we 
have approximated the original nonlinear equation by a linear equation. From the 
work we did in earlier chapters on linear equations we can understand the be- 
haviour of this equation using only the eigenvalues and eigenvectors of the matrix 


Of /dx ae 


ee Gs ag/ay 


loo y*) 


For our example, where 


f(@,y) =x(8—4x —y) and g(x,y) = y3 —3x —y), 


we have 


={8—38%=—% —x 
Dts. ») = ( aby, 2) (32.6) 


32.4.3 The Hartman—Grobman Theorem 


It can be proved (but is well beyond the scope of this book) that the phase portrait 
of the original nonlinear problem ‘sufficiently close’ to a stationary point looks 
‘essentially the same’ as that of the linear equation in (32.5) provided that the 
eigenvalues have non-zero real part. This result, known as the Hartman—Grobman 
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Fig. 32.8. The left-hand picture shows the phase portrait for the linearised equa- 
tion close to the origin; the right-hand picture shows the phase portrait of the 
nonlinear equation close to the origin. The two pictures are indistinguishable. 
(Despite appearances this is an honest figure, with one picture produced using the 
linearisation and one using the full nonlinear equation!) 


Theorem, gives the mathematical foundation for the whole technique that we will 
be using. 
Figure 32.8 shows some solutions of 


x =x(8 —4x — y) 
yo Vise Se = y) 


near the origin, along with some solutions of the linearised equations 


The two pictures appear to be identical. 

The condition that the eigenvalues have non-zero real part means that the sta- 
tionary point cannot be a centre. Indeed, Exercise 32.1 shows that nonlinear terms 
can turn a linearised centre into a stable or unstable focus. However, examiners 
and problem setters have a habit of finding nonlinear systems in which stationary 
points that are centres for the linearised equation do indeed sit in the centre of a 
family of periodic orbits in the nonlinear problem. In all the examples below we 
will check carefully that our linearised centres correspond to centres of the nonlin- 
ear equation. 

The concepts of the stable and unstable manifolds, which perhaps seemed some- 
what artificial when they were introduced in Section 28.5, are much more useful 
when dealing with nonlinear equations. With essentially the same definitions as 
before, the stable manifold of a stationary point x*, W*(x*), consists of all those 
points lying on trajectories that tend to x* as t — oo; while the unstable mani- 
fold W"(x*) consists of all those points lying on trajectories that would tend to x* 
were the direction of time reversed. The stable and unstable manifolds of a saddle 
point in a nonlinear system are tangential to the eigenvectors corresponding to the 
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Fig. 32.9. The stable and unstable manifolds of a stationary point are tangential 
to the eigenvectors: W"(x*) is tangent to the unstable eigenvector (dotted line), 
and WS(x*) is tangent to the stable eigenvector (dashed line). 


negative and positive eigenvalues of the linearisation; once again a proof is beyond 
the scope of this book. This is illustrated in Figure 32.9. 


32.4.4 Step 3: find the stability type of each stationary point 


The next step is to work out the ‘stability type’ (stable node, saddle, unstable focus, 
etc.) of each stationary point. Since the phase portrait ‘close’ to each stationary 
point looks like the phase portrait of the linearised system, we can use this to draw 
the phase portrait near each stationary point. 

For our example, the matrix of partial derivatives was given in (32.6), 


prs» = (FB? —x ). 


—3y 3 —3x —2y 


Near (0, 0) we set € = x and n = y, and so the linearisation is 


dé (8 0 
7=(6 Je 


We can just read off the eigenvalues of this matrix (see Example 27.2); they are 
A = 8 and A = 3, and so this is an unstable node. The eigenvector corresponding to 
A = 8 is (1, 0) (the x-axis), and that corresponding to 4 = 3 is (0, 1) (the y-axis). 
So the linearised phase portrait near the origin looks like Figure 32.10, where the 
trajectories for negative values of x and y (which are not of interest given the 
application of the model) are shown as dotted lines. 
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Fig. 32.10. The linearised phase portrait near (0, 0). 


Fig. 32.11. The linearised phase portrait near (2, 0). 


Near the stationary point at (2, 0) we set = x — 2 and = y and the linearised 


system is 
dg —-8 -2 
a ( 0 =) é: 


The eigenvalues of the matrix we can read off as Ay = —8 and Az = —3 (see 
Example 27.3), and so this point is a stable node. The eigenvector corresponding 
to A; = —8 is just v; = (1, 0) (the x-axis), while that corresponding to A2 = —3 
has to be found from 


and so is v2 = (2, —5). The linearised phase portrait near (2,0) is shown in 
Figure 32.11, where once again the ‘uninteresting’ orbits with y < 0 are shown 
as dotted lines. 
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rs 


Fig. 32.12. The linearised phase portrait near (0, 3). 


Finally we investigate the linearised system near the stationary point on the y- 
axis, (0, 3), where we set € = x and 7n = y — 3. The linearisation is given by 


dé (5 0 
dr @ =e 


Once again we can just read off the eigenvalues, which are 4; = 5 and Az = —3; 
this stationary point is a saddle. The eigenvector corresponding to A, = 5 (‘the 
unstable direction’), vj = (v1, v2), is determined by 


(2 %)(*)=6 


one choice would be v; = (8, —9). The ‘stable eigenvector’ corresponding to 
A2 = —3 is V2 = (0, 1) (the y-axis). This stationary point is a saddle, and the 
linearised phase portrait nearby is shown in Figure 32.12. 

Plotting these three ‘local’ phase portraits on the region x, y > O gives the par- 
tial phase portrait shown in Figure 32.13. 


32.4.5 Step 4: Join the dots’ 


Now we want to join up the local portraits to give the global picture; the principle 
is to join up the local phase portraits in a consistent way. The full phase portrait 
is shown in Figure 32.14. (We saw this before at the beginning of the chapter, but 
now we can draw it without recourse to a computer-generated plot of the direction 
field.) 

It should be easy to read the fate of our two species from the phase portrait. The 
point at (2, 0) is globally attracting; no matter where a trajectory begins (unless it 
is on one of the axes) it tends towards this point. So whatever the initial balance 
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Fig. 32.13. The local phase portraits plotted together. 
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Fig. 32.14. The global phase portrait for equation (32.3). 
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between the species, species ‘x’ always wins out over species “y’, which eventually 
dies out. 

In the next chapter we continue to look at ecological models using these phase 
plane methods, and show that with different choices of parameters the outcome in 
this competitive situation can be much happier, with both species able to coexist. 
We will also look at what happens when one of the species forms the prey of the 
other. 


Exercises 


32.1 The Hartman—Grobman Theorem guarantees that the phase portrait for a nonlinear 
equation looks like the linearised phase portrait sufficiently close to a stationary point 
provided that the eigenvalues have non-zero real part. In particular, the linearised 
system may not give a qualitatively correct picture when the linearised equation pro- 
duces a centre, as this example demonstrates. First show that the origin is a centre for 
the linearised version of the equation 


% = —y+Ax(x* + y’) 
yoxtay* + y?). 
Now write down the equation satisfied by 7, where 


r=x?+y’, 


and hence show that the stability of the origin depends on the sign of 4. Draw the 
phase portrait for 7 < 0. 
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Ecological models 


In this chapter we first investigate what other types of behaviour can arise in mod- 
els of competitive species, and then we consider the more aggressive situation in 
which one species preys on the other. The simple models that we treat here are 
known as Lotka—Volterra systems. 


33.1 Competing species 


It is possible to treat the general model for competing species 


x = x(A —ax — by) 


; 4 
y = y(B—cx — dy), een) 


see Exercise 33.3. However, the general treatment is much less illuminating than 
considering particular examples, and here we deal with two cases that have be- 
haviour which is significantly different from that of the previous chapter. 


33.1.1 Weak competition 


First we consider the example, 


x =x(4—2x —2y) 
y = y(9 — 6x — 3y), 


for which there are four non-negative stationary points (where the right-hand sides 
are zero): if x = 0 then we could have y = 0 or y = 3; if y = 0 then we could 
have the additional stationary point that arises when x = 2; and finally there is an 
interior stationary point when x = y = 1, corresponding to a coexistent state in 
which there are an equal number of both species. So the four possibilities are 


(33.2) 


(0, 0), (2,0), (0, 3) and (1, 1). 
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Fig. 33.1. The linearised phase portrait near (0, 0). 


With f(x, y) = (x(4 — 2x — 2y), yQ — 6x — 3y)) we have 
_ {4-—4x -2y —2x 
ptcx.») = ( —6y ane) 


We now look at the linearisation about the four stationary points above. Near 
the origin we have 
dgé_ (4 0 é 
dt \O 9)” 


so the eigenvalues are 4 and 9, corresponding to the x- and y-axes respectively. As 
before, the origin is an unstable node, as shown in Figure 33.1. 
Near the stationary point on the x-axis, (2, 0), we have 


dé _({-4 —4 é 
dt \O -3/)” 
so the eigenvalues are A; = —4 and Az = —3. The point (2, 0) is a stable node. 


While the eigenvector corresponding to 4 = —4 is easily seen to be v; = (1, 0) 
(it lies along the x-axis), for the other we need to find v2 = (v1, v2) that satisfies 


(oo )(u)=e 


and so v2 = (—4, 1). The local phase portrait is shown in Figure 33.2. 
The stationary point (0, 3) on the y-axis will also turn out to be a stable node. 
Indeed, the linearised system about this point is 


de: i 25+. 
e= (38 “)e 
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LE 


Fig. 33.2. The linearised phase portrait near (2, 0). 


| 


Fig. 33.3. The linearised phase portrait near (0, 3). 


and so the eigenvalues are —2 and —9, both negative, and this is another stable 


node. The eigenvector v; = (v1, v2) corresponding to 4; = —2 is determined by 
0 0 
Ge = (v1) v2, 
and so one choice is (7, —18); the eigenvector corresponding to Az = —9 can 


easily be seen to be v2 = (0, 1), i.e. along the y-axis. Figure 33.3 shows the local 
phase portrait. 
Finally, near the interior stationary point (1, 1) the linearisation is 


dee > = 
2=(% “3 )¢ 


The eigenvalues of the matrix are the solutions A of the characteristic equation 


—2—-2z —2 


a eee = 5 = 
6 gg | = (AT MC3-AF12=1 +54 -6 


= 46-1) =0, 
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Fig. 33.4. The linearised phase portrait near (1, 1). 


and so are A; = 1 and Az = —6; this stationary point is a saddle. The eigenvec- 
tor Vv; = (v1, v2) corresponding to the unstable direction (A; = 1) is determined 
by 


and so is vj = (2, —3); while that corresponding to the stable direction (Az = —6) 
can be found from 


and is v2 = (1, 2). The local saddle point behaviour is shown in Figure 33.4. 

Figure 33.5 shows the local phase portraits near the stationary points plotted 
together. The full phase portrait is shown in Figure 33.6, and is consistent with the 
local patches we drew in Figure 33.5. 

What you can see here is that the stable manifold of the interior stationary point 
(the bold line in the figure) separates two regions of behaviour. Above this ‘sepa- 
ratrix’ all trajectories are attracted to the stationary point (0, 3) which lies on the 
y-axis: species x dies out, and species y settles down to a steady population. Below 
the stable manifold all trajectories tend to (2, 0); species y dies out, and species x 
stabilises at a constant value. Only for very special initial conditions, those that lie 
precisely on the stable manifold, will we end up with the two species coexisting in 
equal numbers at the interior stationary point (1, 1). Any small fluctuations away 
from this point will drive one of the species to extinction. 


33.1 Competing species 327 
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Fig. 33.5. The local phase portraits near the stationary points. 


0 0.5 1 1.5 2 2.5 


Fig. 33.6. The global phase portrait for equation (33.2). The ‘separatrix’ (the 
stable manifold of (1, 2)) is the bold curve. 


33.1.2 Coexistence 
As a final competitive model we will consider the system 


xX =x(4-—2x—-y) 


j= yQ— 3x —3y). ae 
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Again there are four stationary points; this time they are 
(0, 0), (2,0), (0,3) and (1,2). 


The matrix Df of partial derivatives is given by 


4—4x-y —Xx ) 


pics. ») = ( —3y 93x —6y 


Near the origin the linearisation is 


dé _ (4 0 é 

de \0 9)~ 
with eigenvalues 4 (corresponding to the x-axis) and 9 (corresponding to the 
y-axis); an unstable node. The local phase portrait is the same as that in the last 


example, shown in Figure 33.1. 
Near (0, 3) we put € = x, n = y — 3, and linearise to obtain 


abd 0, 
dt \-9 -9]* 
The eigenvalues of the matrix are 4; = 1 and A2 = —9; this stationary point is 


a saddle. The eigenvector vj = (v1, v2) corresponding to A; = 1 (the unstable 
direction) is determined by 


(“5 “io) (a) =* 


and so is (10, —9); while that corresponding to 42 = —9 (the stable direction) is 
just (O, 1). The local phase portrait is shown in Figure 33.7. 


Fig. 33.7. The linearised phase portrait near (0, 3). 
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Fig. 33.8. The linearised phase portrait near (2, 0). 


The linearisation about (2, 0) is 


dé maf =o eae é: 

de 0. S38 
the eigenvalues of this matrix are 4; = —4 with eigenvector v; = (1, 0), andA2 = 
3 with corresponding eigenvector determined by 


(oo )(a)=8 


1.€. V2 = (—2, 7). This is another saddle, with the local phase portrait shown in 
Figure 33.8. 
Finally, the linearised system near the interior stationary point (1, 2) is 


defo A 
t= (—3 a)é 


[where € = x — (1, 2)]. The eigenvalues of this matrix are the solutions 2 of 


S250: tea i ee = 
= Pano A)(-6 — A) —6=A74+ 81 4+6=0. 


Hence 


s+ /64—74 
ye SEV 244 V0 


Note first that both 4; = —4 — V10 and A2 = —4+ 10 are negative, so this sta- 
tionary point is a stable node. The eigenvector corresponding to A; is given by 


Cre im) ()=« 


and so is v; = (1, 2+ 10); while that corresponding to Az is given by 


Co5" 2s) (a) =8 
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Fig. 33.9. The linearised phase portrait near (1, 2). 


as 


Fig. 33.10. The local phase portraits near the stationary points. 


and so is v7 = (1, 2 — V/10). Since Az has the smaller modulus, trajectories ap- 
proach the interior fixed point tangent to the v2 direction; the local phase portrait 
near (1, 2) is shown in Figure 33.9. 

The local phase portraits are all combined on Figure 33.10, and the global phase 
portrait is shown in Figure 33.11. For this choice of parameters the interior station- 
ary point attracts all trajectories, and so any initial condition that includes some of 
both species will lead to a state of coexistence in which there are twice as many 
of species y as there are of species x. If there is only species x then it will settle 
to its own equilibrium x = 2 and if there is only species y then it will settle to 
y =3. 
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0 0.5 1 15 2 2.5 
Fig. 33.11. The global phase portrait. From any initial mix of species the solution 


converges to (1, 2). 


33.2 Predator-prey models I 


Another class of models governs two species when one is the prey of the other, 
e.g. hawks and rabbits. If x is the prey and y the predator, we would expect the prey 
to thrive happily without the predator, so in isolation it should obey the logistic 
equation 


x =rx(k —x). 


In contrast, if left to itself with no prey, and hence no food, species y would die 
out, 


y= Sy. 


Any interaction between the species now favours y (since they need x to eat) 
but clearly disadvantages x (who are eaten). So we end up with the model 


x =rx(k —x —ay) 
y = y(—s + bx), 


where all the parameters are positive. When b < ks there are only two stationary 
points with x, y > 0, and the predator will eventually die out, leaving the prey to 
settle down by itself to its natural equilibrium (x = k); this possibility is treated in 
Exercise 33.1 (vii). 
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We consider here an example that has b > ks and so exhibits more interesting 
behaviour, 


x=x(1—2x—-y) 
: 33.4 
y = y(—2 + 6x). aise 
The three stationary points here are 


0,0, (4,0) and (4,4). 


With f(x, y) = @(1 — 2x — y), y(—2 + 6x)) the matrix Df of partial deriva- 
tives is given by 


1—4x — —Xx 
Divx») = ( 6y ° a) 


Near the origin the linearisation is simply 


dé Lf Li VO é 
dg 0! ae 
so the origin is a saddle; the eigenvalues are 1, corresponding to the x-axis, 


i.e. the eigenvector (1, 0), and —2, corresponding to the y-axis, i.e. (0, 1). See 
Figure 33.12. 


Near G, 0) the linearisation gives 


dé (-1 -}3 
dt © fas 


and so the eigenvalues are A; = —1 and Az = 1 and this stationary point is a sad- 
dle. While the eigenvector corresponding to A; = —1 lies along the x-axis (it is 
bu 
| 
| 
V 
iN a 


Fig. 33.12. The linearised phase portrait near (0, 0). 
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Fig. 33.13. The linearised phase portrait near (5, 0). 


vi = (1, 0)), the eigenvector corresponding to 42 = 1 is determined by 


—2 —5 v1\ _9 
0 0 v2 ‘ 
So is V2 = (—1, 4). See Figure 33.13. 
The linearisation about the interior stationary point is 


2 1 
SN fg er é. 
dt 2 0 
The eigenvalues of this matrix are given by the solutions A of the characteristic 
equation 


and so the interior stationary point is a stable spiral. Trajectories therefore spiral 
in towards Gj. 3)3 to find out whether they spiral in clockwise or anti-clockwise, 
we look at the linearised equations near (3, i). The equation for 7 is 7 = 2&, so 
that for € > 0 (to the right of the stationary point, since x = 5 + &) trajectories 
are moving up, while for € < 0 (to the left of the stationary point) trajectories are 
moving down. The spiralling motion is therefore anti-clockwise (see Figure 33.14). 

The full phase portrait is shown in Figure 33.15. Since the interior stationary 
point is globally attracting, the predator and the prey settle down to a state in 
which there is enough prey to keep the predators alive and the predators never eat 
themselves to extinction. 
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Fig. 33.15. The global phase portrait for equation (33.4). 
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We now suppose that the prey, if left alone, would obey the exponential population 
model 


x = kx 


rather than the logistic model we used previously; this would be the case for a 
population (perhaps unrealistically) not limited by local resources. An example 
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might be whales and plankton. In this case, the equations become 
x =kx(1—ay) 
y = y(bx _ S), 


(with all coefficients positive) and we will analyse these in general. 
There are only two stationary points, at (0, 0) and at (s/b, 1/a). The matrix of 
partial derivatives of the right-hand side is 


kd—ay) —akx ) 


Df = 
(y) ( by bx —s 


Near the origin this gives the linearisation 
dg ke é 
dt \O -s})” 


and so the origin is a saddle, with the stable direction running along the y-axis 
and the unstable direction along the x-axis. About the interior stationary point the 


d€ (0 -—aks/b 
dt \b/a 0 é. 


linearisation is 


The eigenvalues A of the matrix satisfy 


—r —aks/b 2 
ee zy [=a +ks =0, 
and so A = tid/ks. 

The linearised flow near the interior stationary point suggests that it might be a 
centre, but recall that we can only guarantee that the phase portrait for the nonlinear 
equation looks like the linearised phase portrait when the eigenvalues have non- 
zero real part. So we need to do a little more work to check that there really are 
closed orbits around (s/b, 1/a). We can do this because it is possible to find the 
equation of the curves traced out by trajectories. 

Using the chain rule, if y(t) = y(x(¢)) then we have 


dy _dydx 
dt = dx dr’ 
from which it follows that when x 4 0 we have 


dy dy /dx 


dx  dt/ dt 
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(you can think of this heuristically as ‘cancelling the dts’). So therefore along a 


trajectory 


dy y_ y(bx —s) 


dx x  kx(1—ay) 


(cf. example in Section 8.6.2). This equation is separable, 


1—- bx — 
BN go Xx S aie: 
y kx 
and so 
1 b KY 
——ady=—-—— —dx. 
ia aes ay ae 


Integrating both sides gives (since x and y are positive) 


Iny — ay = (bx/k) — 7 Inx eye) 


cl =i sy bx 
=Iny-—ay+—-Inx — — 
x,y y —ay 7 x k 


is constant on trajectories. 

In order to understand the forms of curves of constant C, we can find the turning 
points of the function C(x, y) (see Appendix C for a brief discussion). These occur 
when 0C/dx = dC /dy = 0. Because 

aC s Db 0c 1 


—=—-- and — 
Ox kx =k oy y 


’ 


there is only one turning point at (s/b, 1/a), the interior stationary point. Calcu- 
lating the matrix of second partial derivatives 


a’C/ax* a°C/axdy\ — ( —s/kx? 0 

a*C/dyax 9?C/day* } 0 —1/y? }’ 
it is easy to see that both the eigenvalues of this matrix are negative (they are 
—s/kx* and —1/y7), and so C is a maximum at this stationary point. It follows 


that the curves of constant C are closed curves near the stationary point, and hence 
the trajectories form a collection of periodic orbits. 


Example 33.1 Draw the phase portrait for the equations 


4+=x(1—y) 
y= yx —4) 


and find the equations of the trajectories. 


(33.5) 
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Fig. 33.16. The phase portrait for equation (33.5). 


Using the analysis above, there are two fixed points, one at the origin (which 
is a saddle) and one at (2, 1) which is a centre. The phase portrait is shown in 
Figure 33.16. 

On the curves traced out by solutions we have 


dy  y_ yQx —4) 


dx <x yy 


Separating the variables, 


and integrating both sides gives 
Iny—y =2x —4Inx+C, 
so that 
C(x, y) =Iny + 4Inx — y — 2x 


is constant on trajectories. The level sets of this function C, produced by the 
MATLAB code 
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Fig. 33.17. Curves on which C(x, y) = Iny + 4Inx — y — 2x is constant. 


>> [x, y]=meshgrid(.1:.1:4.5, .1:.1:4.5); 
>> z=log(y)+4*log (x) -y-2*x; 
>> contour (x,y,z, [-2.3 -2.5 -3.5 -4 -4.5]); 


are shown in Figure 33.17 — note how they correspond to the trajectories in 
Figure 33.16. 


Exercises 


33.1 For each of the following models of two species, describe first the type of situation 
being modelled, then find the stationary points, determine their stability type and 
draw the phase portrait for x, y > 0. Finally, say what the phase portrait means for 
the two species. 


(i) 
K=x(2—x-Yy) 
y = y2—- 2y — 2x) 
(ii) 
x= x(2—x-y) 
y = y2—2y —x/4) 
(iii) 


x =x(2—x—-3y) 
y = y2—- 2y — 3x) 
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(av) 
x =x —2y) 
y = y(-2 + 3x), 
find also the equations of the curves along which the solutions move, 
(v) 
x =x(4—x/2—-3y) 
y=y(-2+%) 
(vi) 
x = x(710-—-x —3y) 
y=yU+x — 10y) 
(vii) 


x=xB-—x-y) 
y=y(-2+ x). 
(You could use the MATLAB program lotkaplane.m to help draw some of these 
phase portraits. It asks for the parameters that occur in the general form of the equa- 
tions 
x = A(x + ax + by) 
y = By tex + dy) 
and then draws the trajectory forwards and backwards through specified initial con- 
ditions.) 
33.2 The situation in which two species cooperate, so that the presence of one enhances 


the environment for the other, can be modelled by a coupled pair of equations of the 
form 


x = x(A-—ax+by) 
y = y(B + cx — dy), 


where all the parameters are positive. Draw the phase portraits for the following co- 
operative equations: 


(i) 
x=x1l-x+y) 
y=y+x-—2y), 


(ii) 


x=x(2-—x+y) 
y= y44+2x—y). 
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33.3 (T) Consider the general model of two competing species, 
x = x(A —ax — by) 
y = y(B —cx — dy), 


where all the parameters are positive. Assuming that the intercepts of the nullclines 
(lines on which x = 0 and y = 0) with the x- and y-axes do not coincide, by consid- 
ering the relative positions of these intercepts show that there are four distinct possi- 
bilities for the behaviour of solutions, and find the parameter ranges over which they 
occur. Check that your results are consistent with what you found for the competitive 
examples in Exercise 33.1. 
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Newtonian dynamics 


In this chapter we apply phase plane ideas to various one-dimensional systems 
that model a particle moving under Newton’s laws of motion. First we consider 
systems in which the energy is constant, and then we consider systems in which 
there is some dissipation. 


34.1 One-dimensional conservative systems 


We consider a particle of mass m moving on a line in a potential force field, such 


that its potential energy at position x is given by V(x). Then its kinetic energy is 


i eee : ; 
zmx~, and its total energy is 


E = 5mx? + V(x). (34.1) 
If the energy is conserved then we can differentiate to give 
mxx + V'(x)x = 0; 
provided that x 4 0 we can cancel this term and obtain! 
mx = —V'(x). (34.2) 
By setting y = x, we can rewrite this as the coupled system 


x=y 


aie. (34.3) 


In all that follows we will take m = 1 for simplicity. 
If you think of these equations as describing the motion of a bead sliding on 
a wire whose height at coordinate x is given by V(x) you will get the correct 


1 By assuming the continuity of the function x(t) and its derivatives it is possible to justify this equation for all r, 
even at those times when x(t) = 0. 
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qualitative idea of how the solutions should behave, although this interpretation is 
not entirely accurate, as discussed in the next section. 

Because of the relatively simple form of these equations the possible behaviour 
of the solutions is restricted. First, note that at any stationary point (x*, y*) we 
must have y* = 0 (zero velocity), and x* must be a turning point of V(x) (since 
we need V’(x*) = 0). The linearisation near such a stationary point is simply 


ee 
dt \-V"(*) O}” 


so that the eigenvalues of the matrix are the solutions of 


| 7 = +V"(x*) =0, 


1 
—V"(x*) -A | 
ie. A = +./—V(x*), giving either a pair of real eigenvalues of opposite sign if 
V"(x*) <0 ora pair of purely imaginary eigenvalues if V’"(x*) > 0. 

Thus any stationary points corresponding to maxima of V (where V’”(x*) < 0) 
will be saddle points, while those corresponding to minima of V (where V”’(x*) 
> 0) will be centres for the linearised equation. 

Since trajectories move on the curves 5 y? + V(x) = E, these ‘linearised cen- 
tres’ will in fact be centres for the full nonlinear equations, as we can see by consid- 
ering the curves of constant E (you may find it useful to refer here to Appendix C). 
Maxima and minima of E occur when dE/dx = V’(x) = Oand dE/dy = y = 0, 
precisely at the stationary points. The Hessian matrix of second derivatives of E is 


simply 
V(x) O 
0 Ley 


Since its eigenvalues are V(x) and 1, minima of V are also minima of E, and 
maxima of V are saddle points of E. Thus when V’’(x*) > 0 the point (x*, 0) isa 
local minimum of E, and so nearby curves of constant EF are closed. 


Example 34.1 A particle of mass I moves on a line under the influence of a po- 
tential V(x) =x — 5x3, as illustrated in Figure 34.1. Sketch the phase portrait, 
and describe the motion. 


The energy 


E= hy? +x—- 5x 


will be constant along any trajectory, and the curves of constant FE are shown in 
Figure 34.2. 
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Fig. 34.1. The potential V(x) = x — 5x3 plotted against x. 
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Fig. 34.2. Curves of constant E(x, y) = sy +x—- qx. 


The equation of motion is ¥ = —V‘(x), 
K=-1l+x’, 


and setting y = x yields the coupled equations 


x= 


For a stationary point we need y = 0 and —1 + x” = 0, so there are two station- 
ary points, (—1, 0) and (1, 0). Looking at the potential, you might expect oscilla- 
tions about x = —1 (these would be closed orbits around the point (—1, 0)), and 
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Fig. 34.3. The phase portrait for equation (34.4). Within the bold curve the parti- 
cle oscillates about x = 1, while outside it rolls away to x = +00. 


instability near x = 1 (close to the point (1, 0)). We check this by calculating 


1 
Df(x, y) = Gs 0) . 


The eigenvalues of this matrix are +./2x; the complex conjugate pair +/2i when 
x = —1 and two distinct real values of opposite sign +./2 when x = 1. 

So (—1, 0) is a centre and (1, 0) is a saddle, where the eigenvectors are (1, V2) 
in the unstable direction and (1, —/2) in the stable direction. ‘Joining the dots’ 
we get the phase portrait shown in Figure 34.3. Note that when FE < Egyit = 2/3 
the particle can move to and fro in the ‘well’ of the potential. When EF = Egit 
there is one orbit that starts from the top of the rise, rolls to the left, and then rolls 
back and comes to rest exactly where it started; this trajectory, shown as a bolder 
line in the figure, forms both the stable and unstable manifold of the point (1, 0). 
When E > Eg the particle always rolls off to x = +00. 


34.2 *A bead on a wire 


Earlier in the chapter we said that it was possible to get an accurate qualitative idea 
of the dynamics of the equation 


x = —V'(x) (34.5) 


by imagining a bead sliding on a wire whose height is given by h = V(x). How- 
ever, (34.5) is not the right model for this situation, as we now see by considering 
the ‘bead on a wire’ problem in more detail. 
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Fig. 34.4. A bead on a wire whose shape is given by h = V(x). 


We assume that the bead slides (without friction) on a wire bent into the shape 
given by h = V(x) (where h is the height above some reference level), see Fig- 
ure 34.4. In particular we want to relate this to the behaviour of a particle moving 
on a line in a potential forcefield V (x). Again for convenience we take the mass of 
the bead to be 1. 

Although V (x) is the potential energy (taking units in which g = 1), the kinetic 
energy of the bead also has to include the vertical component of its motion. Since 
the position of the bead when its horizontal coordinate is x is 


x = (x, V(x), 
its velocity is 
k=, VG)%). 
So its kinetic energy, six’, is 
pix? = 54°01 + (V'@))1, 
and the total energy is 
EB = 5x7(1+ (V'(x))*1 4+ VQ). 
Differentiating this gives 
0 = 81+ Vx) 4.8°V'OX)V" (a) a + VDE: 
dividing by x and rearranging we have 


x01 + V/(x)7] = —V/(x)V" (x) x? — V(x), 
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or 


tee | 


a —V'(x) i+ Vay 


Setting y = x we end up with the coupled system 


tay (34.6) 


” 2 
y= —V'(x) - + V"(x)y | 


1+V'(x)? eae 


We will now see that these complicated looking equations have the same sta- 
tionary points as the system 


kx=y y=-V(x) 


and that these stationary points have the same stability properties for both sets of 
equations. 

First, note that at any stationary point (x*, y*) we must still have y* = 0, using 
equation (34.6). For y = 0 equation (34.7) then requires 


: 1 

Be fi + var 
(the expression in the square brackets has simplified since y* = 0). Since the de- 
nominator 1 + V’(x) is always strictly positive, we must have V’(x*) = 0 for a 
stationary point. So, as before, the stationary points (x*, 0) occur when x* is one 
of the turning points of V (x). 

With a little more algebra we can also show that the stationary points have the 
same stability type as their counterparts in the simpler problem, which depended 
only on the sign V’’(x*). Here Df is given by the daunting expression 


0 1 
Df(x) = & LV"? ) yr [VV y?—2v/ VLE?) -awy : 
It+v?2 (1+V/)2 l+v2 


where V’, V”, and V’” are understood to depend on x. However, near any station- 
ary point (x*, y*) the linearisation is simply 


fos 8) 
ae —~V"(x*) 0 > 


just as before, since y* = 0 and V’(x*) = 0 at any stationary point. Just as for 
the simpler model, if V’’(x*) > 0 then the stationary point is a centre, and if 
V"(x*) < 0 then it is a saddle. 
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Fig. 34.5. The phase portrait for a bead moving on a wire bent in the shape h = x — 5x. 


3 


For a bead sliding on a wire bent into the shape h(x) = x — 5x as in Exam- 


ple 34.1 the equations are x = y and 
f= 2x97 
pe=0 =34)| = |. 
‘ ( | | 
The phase portrait is shown in Figure 34.5. Although the individual trajectories 
have changed shape, the qualitative behaviour is the same as for the particle mov- 


ing in the potential V(x) (see Figure 34.3); if the energy is below a certain level 
the bead oscillates, while if the energy is too high the bead escapes to x = +00. 


34.3 Dissipative systems 


In Chapter 13 we looked at models of oscillating systems, both with and without 
damping. There we modelled the effect of damping by including in the equation 
for x an additional term —kx (with k > 0), representing a force acting to oppose 
the motion. 

If we include a similar factor in equation (34.2) then it becomes 


mx = —kx — V'(x). (34.8) 


The new damping term has the effect of dissipating the energy in the system. If we 
now calculate dE /dt we have 


dB. dh g 
= mxx + V'(x)x = —kx? — V’(x)x + V/ (x) x 
= —kx? <0. 
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Since —kx? is negative unless the particle is at rest, the energy always decreases 
while the particle is moving. 
Writing (34.8) as a coupled system we have 


x=y 
my = —ky — V'(x). 


We will first consider such systems in general, relating their behaviour to that of 
the undamped system with k = 0. Again we will take m = 1 for simplicity. 

The stationary points of the damped system are the same as the stationary points 
of the system without damping, since at any stationary point we must have y* = 0 
to make x zero, and then the equation for y requires that V’(x*) = 0. Although the 
stationary points do not move with the addition of damping, their stability prop- 
erties change. The eigenvalues of the matrix that occurs in the linearised equation 
near a stationary point (x*, 0), 


dé ( 0 1 


are the solutions of the characteristic equation 


| ae Mage +kA+ V(x") = 0. 


—V"(x*)  -k- ; 


Using the quadratic formula, 


rv 


oaks Vk? — 4V"(x*) 
= 5 ; 


Without damping (k = 0) any maximum of V (where V’”’(x*) < 0) was a sad- 
dle point. Even with damping this is still the case, since the expression within 
the square root is always positive and greater than k, giving one positive and one 
negative eigenvalue. However, the stability type of minima (where V”(x*) > 0) 
changes. Where before we had a purely complex pair of eigenvalues, and so a 
centre, now the real part of both eigenvalues will be negative, and depending on 
whether k* > 4V""(x*) or k? < 4V"(x*) the stationary point will be a stable node 
or a stable spiral. 

We now return to the system of Example 34.1 and investigate the effect of such 
a damping term. 


Example 34.2 Draw the phase portrait for the system 
x=y 


y=-y—14+x?. 
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As we noticed in general above, the stationary points are the same as they were 
for the undamped system, namely (£1, 0). However, their stability properties have 
changed. If we now calculate Df we have 


1 
Df(x, y) = Cs Ae 


About (1, 0) the linearisation is 


dé (0 1 

dt ie =) 6: 
and the eigenvalues of the matrix of the right-hand side are given by the solutions 
of 


—iX 1 
2 -l-A 


[=2?+44-2=@-DA+2)=0, 


ie. A = 1 or A = —2. The eigenvalues are real and of opposite sign, and this sta- 
tionary point is still a saddle. The eigenvector corresponding to A; = 1 is given 
by 


and therefore is v; = (1, 1); while the eigenvector corresponding to 42 = —2 can 
be found from 


and therefore is vz = (1, —2). 
Near the stationary point (—1, 0) the linearisation is 


dg 0 1 
dr (°, je 


and the eigenvalues of the matrix are the solutions of 


—x 1 5) 
ze a3 +iA4+2, 
which are 
-lt/J1-8 
A= = —5(1+ V7i); 


2 


this stationary point is a stable spiral. 
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Fig. 34.6. The phase portrait for the dissipative system from Example 34.2. 
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Fig. 34.7. From left to right, the potentials from parts (i), (ii) and (iii) of Exer- 
cise 34.1 plotted against x. 


The phase portrait is shown in Figure 34.6. It is clear that now, with damping, 
there are no periodic orbits, and the particle will either come to rest at x = —1 or 
escape to x = +00. 


Exercises 


34.1 For the following choices of potential functions V(x) write down the total energy 
for a particle of unit mass, and assuming that this is conserved write down a coupled 
system for x and y = x. Draw the phase portrait and interpret the dynamics. 

(i) V(x) = 40x?; 
(ii) Vx) = 5x4 — x?; and 
(iii) V(x) = gx° — 3x4 + 2x7. 
(Pictures of these potentials are shown in Figure 34.7.) 

34.2 For the functions V(x) in parts (i) and (ii) of Exercise 34.1 write down the kinetic 
energy of a particle of unit mass moving on a wire whose height as a function of x is 
V(x). Taking g = 1 write down the total energy, and hence derive the second order 
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equation satisfied by x. Write down a coupled system for x and y = x, and draw the 
phase portrait. 

Write down the equation of motion for a particle of unit mass moving in each of the 
potentials in Exercise 34.1, when there is an additional damping force —x (in part (i) 
take m = 1). Draw the phase portrait for each case. 

(T) A particle of unit mass moves on a wire whose height as a function of x is V(x), 
and is subject to an additional damping force —kx. Write down the equation of mo- 
tion, and show that the behaviour of this system is qualitatively the same as that of 


% = —V' (x) — ki. 


(C) Investigate the dynamics of the equations in exercise 34.1 both with and without 
damping, using the M-file newtonplane.m. The program asks for the level of 
damping k, and then a succession of initial conditions. The equation is specified in 
the file newt onde .m, currently set up for the example V(x) = x — 5x in the main 
text. By changing this file you should be able to consider all the examples in Exercise 


34.1, and also the equivalent problems for a ball rolling on a wire. 


35 


The ‘real’ pendulum 


We end our treatment of phase portraits by returning to the example of the simple 
pendulum. 


35.1 The undamped pendulum 


In Chapter 13 we derived the exact equation for the motion of an ideal pendulum, 


da 
qo —w* sind, (35.1) 


(where w? = g/L) but we then approximated this by 6 = —w76 in order to apply 
the methods we had just learned for linear equations. 

Here we will use phase plane methods to understand the nonlinear equa- 
tion (35.1). For simplicity we will choose w = 1 and consider the equation 


d0 


di = —sind. 


In order to look at this as a set of coupled first order equations we set x = 6 and 
y = 6 and then 


ace (35.2) 
y = -—sinx. 

Note that the direction field (shown in Figure 35.1) repeats itself every 27 in 
the horizontal direction. This should not be a surprise, since the x coordinate rep- 
resents the angle of the pendulum to the vertical (@ in our original equation), and 
the value 6 = x + 27 corresponds to the same position of the pendulum as 6 = x. 
So we should consider (x, y) and (x + 27, y) as representing the same physical 
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Fig. 35.1. The direction field for the pendulum equation. 


state of the system. The natural way to present our phase diagrams, then, is to 
restrict to a range of x corresponding to one particular choice for the angle 0, 
—7 <x <1, say. We will show a slightly longer range of x, since this may be 
helpful in understanding the phase portraits, but you should remember that the fig- 
ures ‘wrap around’ from one side to the other. We return to this at the end of the 


chapter. 
The first step in the phase plane analysis is always to find all the stationary 
points. If (x*, y*) is a stationary point then we need 


a =O and —sinx* = 0. 


The first equation tells us that y* must be zero at any stationary point, while the 


second implies that x* = kam for some (positive or negative) integer k. 
Thus any point x* = (kr, 0) with k an integer is a stationary point. In fact there 


are just two distinct stationary points here: (0, 0), which corresponds to the pendu- 
lum hanging vertically downward; and (a, 0) which corresponds to the pendulum 


being precariously balanced vertically upward. 
To check the stability of these stationary points we have to consider the eigen- 


values of the matrix of partial derivatives 


Di = @e Xx ») 


near the stationary points. 
Near the origin (and all stationary points (2k77, 0)) the linearised equation is 


de oO 
aed 3g (35.3) 


the eigenvalues of this matrix are +i, so the origin is a (linearised) centre, round 


which the orbits travel clockwise (when € = 0 and 7 > 0, é =n> 0). 
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Fig. 35.2. The local phase portraits drawn near three stationary points. 


Near (7, 0) (and all the stationary points ((2k + 1), 0)) the linearisation is 


dé (0 1\, 
== (i 5) & (35.4) 


the eigenvalues are +1 with corresponding eigenvectors (1, +1). 

We can now put this information on a phase diagram, at present just drawing 
some trajectories close to the stationary points. Figure 35.2 shows the local phase 
portraits near the three stationary points closest to the origin, (7, 0) and (0, 0) 
itself. 

To complete the picture it is helpful to use the fact that the energy is constant, 


sy" —cosx = E = constant. 


On each solution curve 


y? = 2(E + cosx) => y=t/2(E + cosx) 


and the type of curves traced out by trajectories depend on the value of E. It is 
clear that if E > 1 then we can solve for y for every value x; there will be one 
curve that has y > 0 and one with y < 0. If E < 1 then we can only solve for a 
range of x values, and this will lead to a closed curve passing through y = 0. The 
critical value is E = 1. Curves of constant E are shown in Figure 35.3, and the 
phase portrait is shown in Figure 35.4. 


Note that 


(i) for E < 1 there are a collection of closed curves circling the origin; 
(ii) the curves corresponding to E = 1, shown as bold lines in Figure 35.4, form the stable 
and unstable manifolds of the saddle points at (+77, 0), and connect these two points; 
(iii) for E > 1 the value of x is either always increasing (y > 0) or always decreasing 
(y < 0). 


These translate to the following behaviour of the original pendulum: 
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Fig. 35.4. The phase portrait for the simple pendulum. The bold lines indicate the 
“separatrix’, dividing a region in which the pendulum oscillates about the down- 
ward vertical, and a region in which the pendulum whirls around through 360°. 


(i) If the pendulum has less than a critical amount of energy (EF = 1) then it performs 
oscillations back and forth about 6 = 0, i.e. its rest position pointing downwards. 

(ii) If the pendulum has the critical amount of energy then it can make exactly one rev- 
olution from the upright position (9 = —7r) back to the upright position (6 = Tr) (or 
vice versa) and no further. 

(iii) If the pendulum has more energy (EF > 1) then it can whirl around forever. 
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Note that the stable manifold of the saddle points (the bold lines in Figure 35.4) 
form the “separatrix’ that divides the phase plane into a region of oscillation and a 
region of ‘whirling’. 


35.2 The damped pendulum 


It is, of course, possible to analyse the damped pendulum in a similar way. With 
the addition of damping the equations become 


x=y 


y = —sinx —ky, oD) 


where k > 0. (For the behaviour of the system with a quadratic damping term 
—ky|y| in the equation for y see Exercise 35.2.) While the stationary points are 
unchanged, their stability changes. 

The saddle points at (+77, 0) remain saddles, since the linearisation near these 
points is now 


and the eigenvalues of the matrix are the solutions of 
+kr-1=0, 
which gives 


_ kav +4 
=, 


Xr 


so that both eigenvalues are negative. The eigenvector in the stable direction (cor- 
responding to the negative eigenvalue) is 


(erra_t): 


and the eigenvector in the unstable direction (corresponding to the positive eigen- 
value) is 


(_,_veza)) 
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Fig. 35.5. The phase portrait for the damped pendulum. All trajectories tend to 
(0, 0). Remember that x = 7 and x = —T correspond to the same angle, there- 
fore as trajectories leave the diagram on the left they reappear on the right (and 
vice versa). 


The origin becomes a stable node or spiral; the linearisation is now 


and so the eigenvalues of the matrix are the solutions of 
2 +kA+1=0, 


which are 


—k+JVik2—4 
5 , 


A= 


The phase portrait is shown in Figure 35.5 for the choice k = 1 when 


—-1+iV/3 
2 


A= 


and the origin is a stable spiral. 

The behaviour predicted by this phase diagram accords with our physical in- 
tuition; from any initial condition (apart from being precariously balanced ver- 
tically upwards) the pendulum will eventually come to rest hanging vertically 
downward. 
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Fig. 35.6. The ‘phase cylinder’ for the pendulum. The ‘*’ represents the pendu- 
lum pointing vertically downwards, and the ‘x’ is the cylinder vertically upwards. 
The angular coordinate is 0 (or x modulo 27, the angle of the pendulum to the 
vertical), and the vertical coordinate is 9 (or y in our coupled equation). 


35.3 Alternative phase space 


Since the point (x + 27, y) corresponds to the same state of the system as the 
point (x, y), it would really be more sensible to draw a phase diagram that reflected 
this. Figure 35.6 shows the phase diagram drawn on the ‘phase cylinder’, where 
the coordinate on the circle represents the angle 6, and the vertical coordinate 
the angular velocity 6. The M-file cylinder .m draws animated trajectories of the 
pendulum equation moving on this surface. 

It is possible to cast the whole theory of dynamical systems in a general setting 
that enables the use of such phase spaces (which are manifolds) in a very natural 
way. 


Exercises 


35.1 Draw the phase portrait for the damped pendulum equations in (35.5) when k = 2 
and when k = 3. 
35.2 Consider the equation for a pendulum with a quadratic damping term 
x=y 
y = —sinx — kyly|. 
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Show that if FE = sy? — cos x then 


dE 
— =—ky*ly]. E35.1 
7 y|y| ( ) 


Show that the point (0, 0) is a centre for the linearised equation, but using (E35.1) 
deduce that for the nonlinear equation it behaves like a stable spiral, and hence draw 
the phase diagram. (Remember that ‘linearised centres’ do not have to be centres for 
the nonlinear equation.) 
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*Periodic orbits 


We have already seen that showing the existence of periodic solutions is much 
more difficult than showing that there are stationary points, and that the ‘joining 
up’ of trajectories that is required for a periodic orbit is a sensitive thing. In this 
brief chapter we look at two results, one that excludes the possibility of there being 
any periodic orbits, and one guaranteeing that there is at least one. 


36.1 Dulac’s criterion 


Dulac’s criterion is a way of showing that there cannot be any periodic orbits within 
some region of the phase space. Suppose that we are considering trajectories of the 
differential equation 


x= f(x,y) 
y = g(x,y). 


Then given a region 2 C R?, if we can find a smooth function h(x, y) such that 
0 a 
a hf) + the) #9 
Ox dy 


for all x, y € 92 then there are no periodic orbits contained wholly within 2. The 
proof is straightforward, but relies on the divergence theorem.! 


! Tn R? the divergence theorem says that if £2 is a region with smooth boundary Ih : R? > Randf: R? > R2 
are continuously differentiable functions, then 


i V - (fh) x= [ h(n-f) ds, (36.1) 
2 rT 
where n is the unit outward normal to I, see Figure 36.1. 

Now, if we can find a function / such that V - (ff) 4 0 in &2, then this means that the sign of V - (f/) is 
constant throughout $2. Then in particular we must have 


i; V- (fh) d2x 40. 
2 


360 


36.2 The Poincaré—Bendixson Theorem 361 


Fig. 36.1. A region {2, its boundary I and one of the outward normals n. 


For example, we can easily show that for many choices of parameters there are 

no periodic orbits in the ecological models 

x =x(A-—ax+by) 

y = y(B -—cy+dx). 
We will suppose that a,c > 0, but will say nothing about A, B, b and d. If we 
choose h(x, y) = (xy)7! then 

ae x) _ ee —a(x/y) + 1) 
hg (x, y) B/x —c(y/x) +d 


and 


dA ax 0;/B oc a ec 
—}—-—+b))/4+— eeeey, =—-——-., 
ax | y y dy |x x y x 


Since a,c > 0 there can be no periodic orbits in the region x, y > 0, since this 
expression is always negative there. 


36.2 The Poincaré—Bendixson Theorem 


More positively, the Poincaré—Bendixson Theorem guarantees the existence of 
a periodic orbit under certain conditions, and limits the complexity of two- 
dimensional systems. 


Theorem 36.1 Let D be a bounded region that orbits enter and never leave and 
that contains no stationary points. Then any orbit entering D is attracted to a 


However, if Mis a periodic orbit then it is everywhere tangent to the velocity field f, cf. Figure 32.3. This means 
that n, which is normal to the periodic orbit, must be normal to the velocity field, and so f-n = 0. Thus the 
right-hand side of (36.1) must be zero, a contradiction. 
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Fig. 36.2. If orbits enter and never leave the shaded region (D in the theorem), 
and this region contains no stationary points, then there is at least one periodic 
orbit contained entirely within D. 


periodic orbit; in particular there is at least one periodic orbit lying within D. 
(This is illustrated in Figure 36.2.) 


The following corollary (which follows with a little additional work) says that 
the most complicated behaviour in two-dimensional systems is a periodic orbit. 


Corollary 36.2 If all orbits are bounded then each trajectory converges either to 
a Stationary point or to a periodic orbit. 


Exercises 
36.1 Use Dulac’s criterion to show that periodic orbits in the equations 
x=y 
y = —ky — V(x) 


are only possible if k = 0. 
36.2 Consider the coupled system” 


& = y+ 4x(1 — 2r’) 
—x+5y(-7r%), 


y 


2 


where r* = x* + y?. First, show that the system has only one stationary point which 


2 This example is taken from P. A. Glendinning Stability, instability, and chaos (Cambridge University Press, 
1994). 
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lies at the origin. Now, by finding the equation satisfied by r, show that trajectories 
enter (and do not leave) the region D, where 


D={Gi)t 5 =r SI}. 


Use the Poincaré—Bendixson Theorem to deduce that the system has a periodic orbit 
lying within D. 
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We saw at the very end of the last chapter that the most complicated dynamics that 
can occur in a system of two coupled equations (a two-dimensional system) is a 
periodic orbit. 

In this chapter we discuss, briefly, a three-dimensional system of equations, 
the Lorenz equations, which demonstrates that much more complexity is possi- 
ble once we have three variables. Without the help of numerical solutions and 
computer-aided visualisation our understanding of these equations would still be 
poor. This chapter contains many computer-generated images, and some samples 
of MATLAB output performing some tedious eigenvalue calculations. Most of this 
was generated by the M-file Lorenz37.m, and it might be helpful to have this 
program running as you read. 

Lorenz introduced his relatively simple system as a model in which to study 
various theoretical problems involved in meteorology, and in particular in weather 
prediction.! It is based on a model of convection (when a layer of fluid is heated 
from below), greatly simplified. The model is 


x =o(-x+y) 
y=rx-—y—xXxz (37.1) 
Z=—bz+xy. 


There are three parameters in the problem: b, r, and o. Standard values have es- 
tablished themselves over the years: b = 8/3, r = 28, ando = 10. 


2 


6 parameters 


>> sigma=10; r=28; b=8/3; 


Ig, Lorenz, Deterministic nonperiodic flow, J. Atmos. Sci. 20 (1963), 448-464. 
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The basic steps towards understanding the problem should be familiar from our 
analysis of two-dimensional systems; we find the stationary points and determine 
their stability. 

For these parameter values there are three stationary points; one at the origin (it 
is easy to find this one) and two more at 


(tVb(r — 1), +Vb(r — 1), rr — 1). 
% non-zero fixed points 
>> x=sqrt(b*(r-1)); y=x; z=r-1; [x y z] 
ans = 8.4853 8.4853 27.0000 
In order to determine the stability type of the stationary points we have to look 


at the linearised equation near each one. The matrix of partial derivatives is now 
3 x 3, 


Df(x, y,z) = ar —1 —Xx 


To determine the stability we look (as we did for systems of two equations) at the 
eigenvalues of this matrix. At the origin the matrix is 


and its eigenvalues are given by the solutions of 


-o-iA o 0 
r —-1-A 0 = (—b — A)[(—o0 — A)(—1 — A) - or] 


=-(A+b)A?+(1+o0)A—o(r —1)] = 0. 


SoA = —b or 


_ -d+oe)+VJV0 +0)? +40(r — 1) 
am 7 ; 


Xr 
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For the particular parameter values above this gives the three eigenvalues 


—22.8277, 11.8277 and —8/3. 


eigenvalues and eigenvectors at the origin 


oe =o 


L gives values, V gives vectors 


[V L] = eig([-sigma sigma 0; r -1 0; 0 0 -b)) 


V = -0.6148 -0.4165 0 
0.7887 -0.9091 0 

0 0 1.0000 

L = -22.8277 0 0 
0 11.8277 0 

0 0 -2.6667 


There are two stable directions at the origin, and one unstable one, so the origin 
is some kind of three-dimensional analogue of a saddle point, as can be seen if 
you look at the direction field near the origin (shown in Figure 37.1) in the right 
way. 


Fig. 37.1. The direction field near the origin. 
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Fig. 37.2. The direction field near one of the non-zero stationary points. 


The linearisations near the two non-zero stationary points is the same, and the 
eigenvalues are most easily found numerically: 


fe) 


% eigenvalues and eigenvectors at the non-zero points 


>> [V L]=eig([-sigma sigma 0; r-z -1 -x; y x -b]) 


V= 0.8557 -0.2779 - 0.28391 -0.2779 + 0.28391 
-0.3298 0.0089 - 0.56991 0.0089 + 0.56991 
-0.3988 -0.7186 + 0.02931 -0.7186 - 0.02931 

L = -13.8546 0 0 

0 0.0940 +10.1945i 0 
0 0 0.0940 -10.19451 


Near these points there is one stable direction, and a two-dimensional unstable 
manifold of ‘spiral type’. It is quite hard to see all this structure in the direction 
field (shown in Figure 37.2), but you should be able to make out the ‘rotating’ 
behaviour. 

The trajectories cannot just settle down to a stationary point, since all of them 
have unstable directions. There is also no obvious way to ‘join the dots’ of the 
local phase portraits, as should be clear from Figure 37.3, which puts the direction 
fields near the three stationary points on one figure. 

However, the trajectories do not escape to infinity; if we consider 


ViayQ=artyt+e-o-ry 
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Fig. 37.3. The direction fields near the three stationary points. 


(which is the square of the distance of the point (x, y, z) from (0,0, 0 +1r)) then 
V(t) changes according to 


“ = 24%4+ 2yy +2 =e —1)z 
= 20x(—x + y) + 2y(rx — y — xz) + 2(2 —o —r)[—bz4+ xy] 
ans 2y? ey ae 2b(r +a)z 
~2ox? —2y? —b(zg —r—0)* — bz? + d(r +0) 
—aV +b(r +0)’, 


A 


where @ = min(2o0, 2, b) (which is 2 for our choice of parameters). This shows 
that V decreases when V > b(r +. o)*/a. It follows that eventually all trajectories 
will have V < 2b(r +0)? /a, and lie at a bounded distance from (0, 0,7 +0). 

So what happens to the trajectories? One thing we might try, looking at the 
solutions as functions of time, is unhelpful (see Figure 37.4). 

Even though a picture of the trajectory traced out by a typical solution 
gives a very complicated looking picture, this approach is much more fruitful. 
Figures 37.5 and 37.6 give two different views of the same trajectory after a small 
period of time. 

Essentially, these figures show the famous ‘Lorenz attractor’. Unless you start 
on the stable manifold of one of the stationary points, if you trace out the trajectory 
in the three-dimensional phase space you will end up with pictures that look some- 
thing like these. You can do this for various choices of initial conditions using the 
M-file lorenzdraw.m. 
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Fig. 37.4. Graphs of x, y, z, and V(x, y, z)!/* against t on one solution. 


Lorenz attractor 


Fig. 37.5. A trajectory of the Lorenz equations. 


However, suppose that you start two trajectories very close together. The re- 
sult of doing this with two initial conditions that are extremely close (the initial 
x coordinates differ by 0.0001) is shown in Figure 37.7 (the x coordinate only). 
Although the solutions look the same for a time, after a while they are completely 


370 37 *The Lorenz equations 


30 20 x 


y 


Fig. 37.6. Another view of the trajectory in Figure 37.5. 


-20 L L L Ll Ll Ll Ll L 
0 5 10 15 20 25 30 35 40 45 50 


Fig. 37.7. The equations exhibit sensitive dependence on initial conditions. 


different. This is another example of the ‘sensitive dependence on initial condi- 
tions’ that we saw for the logistic map in Chapter 24. 

This is illustrated more dramatically in the four pictures that make up Fig- 
ure 37.8. In each picture there are in fact 125 values of x(t). They start so close 
together that even at time t = 18, shown in the first picture, they still appear to 
be the same. However, in the following three pictures you can see them spreading 
apart, until at time ¢ = 34 they are scattered all over the attractor. (The MATLAB 
program solvem.m performs all the integrations necessary to generate all the so- 
lutions, which you can then watch as they move using flies.m.) 

A very small initial change makes a huge difference to the way the solutions 
eventually behave. This is the origin of the notion of the ‘butterfly effect’; if a 
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40 40 
20 20 
0 0 N 
20 20 20 20 
0 0 
-20 ‘ -20 20 


Fig. 37.8. 125 solutions of the Lorenz equations, at times tf = 18 (top left); t = 21 
(top right); t = 24 (bottom left); and t = 34 (bottom right). The axes are orien- 


tated similarly to those in Figure 37.6. 


Fig. 37.9. Graph of a solution z(t) and its successive maxima. 


butterfly flaps its wings on one side of the world it could cause a storm on the other. 
In other words, very small changes to the initial condition can produce widely 
different effects after some time. In terms of weather prediction we know this 
very well; forecasts are reasonably accurate for the next day, but not for one week 
later. Of course, the ‘butterfly effect’ is a very colourful description of the idea. 
Atmospheric models will take into account the fact that variations on very small 
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Fig. 37.10. The result of plotting successive maxima of z(t) against each other. 


scales (like the flapping of the butterfly) cannot be expected to have such huge 
effects, because the viscosity of the air should damp out such disturbances. 

As one way of trying to understand how these complicated solutions behave, in 
his original paper Lorenz suggested concentrating on the successive local maxima 
of z(t). If z, is the nth local maximum, see Figure 37.9, Lorenz suggested plotting 
the next local maximum z,4+1 against z,. A priori there is no reason why this should 
not just produce points all over the plot. 

However, Figure 37.10 shows the result. Remarkably, it appears that z,+1 is 
given by a function of z,, say f, so that 7,41 = f(Z,). Thus it should be possible 
to investigate the solutions of the Lorenz equations by analysing the dynamics of 
this iterated map, which is surprisingly similar to the tent map that we considered 
at the end of Chapter 24. 


38 
What next? 


In this book we have covered all of the basic methods for finding the explicit so- 
lutions of simple first and second order differential equations, along with some 
qualitative methods for coupled nonlinear equations. We have also discussed dif- 
ference equations, and seen how complicated the dynamics of even very simple 
iterated nonlinear maps can become. 

There are two ways in which to proceed further with the material developed 
here. One arises from turning first to the study of partial differential equations, 
while the other essentially continues from where we have left off. 


38.1 Partial differential equations and boundary value problems 


Partial differential equations model systems that have spatial as well as temporal 
structure, for example the temperature throughout an object, the vibrations of a 
string or a drum, or the velocity of a fluid. 

In general linear partial differential equations are easier to solve. By using the 
technique known as ‘separation of variables’ it is possible to convert such a prob- 
lem into an ordinary differential equation. This was touched on briefly in Exer- 
cise 20.10, and the exercises in this chapter apply this method in more detail for 
the example of the vibrating string. 

Viewed in general this approach leads to the theory of Fourier series (the ex- 
pansion of an arbitrary function as a sum of sines and cosines, see Exercises 38.7 
and 38.8), and its extension via the Sturm—Liouville theory of boundary eigenvalue 
problems 


d dy ; 
awe (om) + q(x)y =Aw(x)y with y(a) = y(b) = 0. 
x dx 


These topics are treated at length in many differential equation textbooks, for ex- 
ample, 
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W. E. Boyce and R. C. DiPrima, Elementary differential equations and boundary value 
problems, 7th edition (John Wiley & Sons, 2001) 

C. H. Edwards and D. E. Penney, Differential equations and boundary value problems, 
2nd edition (Prentice Hall, 2000) 

R. K. Nagle, E. B. Saff and A. D. Snider, Fundamentals of differential equations and 
boundary value problems (Addison-Wesley, 2000), 


all of which also cover much of the material presented in this book. 
The theory of Fourier series is treated rigorously in 


H. A. Priestley, Introduction to Integration (Oxford University Press, 1997) 


Sturm-—Liouville theory provides one of the first concrete applications in intro- 
ductory courses on functional analysis, see for example, 


E. Kreyszig, Introductory Functional Analysis with Applications (Wiley, 1978) 

M. Renardy and R. C. Rogers, An introduction to partial differential equations, in the 
series Texts in Applied Mathematics Volume 13 (Springer Verlag, 1992) 

N. Young, Hilbert Spaces (Cambridge University Press, 1988) 


38.2 Dynamical systems and chaos 


We have investigated, at least numerically, the complicated dynamics of the Lorenz 
equations. Similarly we spent a chapter examining the behaviour of the iterated lo- 
gistic map. Both of these examples pose problems whose solution is beyond the 
techniques presented in this book, and fall into the realm of the theory of dynam- 
ical systems. The subject received a large boost in the 1980s, when it was popu- 
larised under the media-friendly ‘chaos’ banner. One particularly readable popular 
account is 


J. Gleick, Chaos: making a new science (Minerva, 1997) 
while one of the early pioneers in the subject presents it in an accessible way in 
E.N. Lorenz, The essence of chaos (University of Washington Press, 1994). 


The theory is concerned with the qualitative behaviour of the solutions of dif- 
ference and differential equations in very general situations in which we cannot 
hope to be able to find an explicit solution. Since we cannot base our understand- 
ing on explicit solutions, the subject relies on a series of very powerful general 
results (like the Hartman—Grobman Theorem we used to draw our phase portraits 
for nonlinear coupled equations) and inspired simplifications (like the substitution 
we used to turn the logistic map x,+1 = 4x,(1 — x)n) into the simpler ‘tent map’). 

Books that treat the subject from a rigorous but accessible point of view, and 
could be seen as continuing naturally from the material we have covered here are 


V. I. Arnol’d, Ordinary Differential Equations, 3rd edition (Springer Verlag, 1992) 
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R. L. Devaney, An introduction to chaotic dynamical systems, 2nd edition (Westview 
Press, 2003) 

P. A. Glendinning, Stability, instability, and chaos (Cambridge University Press, 1994) 

J. Guckenheimer and P. Holmes, Nonlinear oscillations, dynamical systems and 
bifurcations of vector fields, in the series Applied Mathematical Sciences 42 
(Springer Verlag, 1983) 

M. W. Hirsch and S. Smale, Differential equations, dynamical systems, and linear algebra 
(Academic Press, 1974) 

D. W. Jordan and P. Smith, Nonlinear ordinary differential equations, 2nd edition (Oxford 
University Press, 1999) 


Two classic advanced texts that concentrate more on the rigorous proof of fun- 
damental results are 


J. K. Hale, Ordinary Differential Equations, 2nd edition (Krieger, 1980) 
P. Hartman, Ordinary Differential Equations, 2nd edition (SIAM, 1973) 


Finally, I cannot end without recommending unreservedly the thoughtful and 
entertaining book 


D. Ruelle, Chance and Chaos (Penguin Books, 1993) 


Written by one of the foremost mathematicians working in the field, it combines 
a discussion of various topics from dynamical systems and modern theoretical 
physics with many insights into the life of the research mathematician, and life in 
general. 


Exercises 


This sequence of exercises treats the problem of the vibrating string using the method of 
separation of variables. This produces a simple boundary value problem, and serves to 
introduce the idea of Fourier series. 

The equation for the vibrations of a string stretched between x = O and x = | and at- 
tached at both endpoints is 


a7u 7) au 

or ax?” 
with u(x, t) representing the height of the string at position x at time f. Since the string is 
fixed at the endpoints, we should have u(0, t) = u(1, t) = 0 for all t. See Figure 38.1. 


(E38.1) 


38.1 Show that the principle of superposition is valid: if two functions u(x, f) 
and u2(x, ft) satisfy the equation and the boundary conditions, then u(x, tf) = 
auy,(x,t) + Bu2(x,t) also satisfies both the equation and the boundary condi- 
tions. 

38.2 Show that if we guess that a solution has the form u(x, t) = X(x)T(t) then X (x) 
and T(t) must satisfy 


1@r idx 
eT dt? X dx?" 
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Fig. 38.1. A vibrating string, fixed at the endpoints x = 0 and x = 1. 


Since the left-hand side is a function of ¢ alone, and the right-hand side is a function of x 
alone, the only way that they can be equal is if they are both constant. If we choose this 
constant to be —A then we obtain two equations, 


a err (E38.2) 
—— =—he ; 
dt? 
and the boundary value problem 
d?x 
aa = —AX with X(0) = X(1) = 0. (E38.3) 


38.3 Show that if A. < 0 then the only solution of (E38.3) is X (x) = 0 forall x € [0, 1]. 
(You can find the general solution using the methods you have learned in this 
book, and then choose the constants in order to satisfy the boundary conditions.) 

38.4 Show that if A > 0 then we only have X (x) 4 0 if we choose A = nq for some 
integer n, and then 


X(x) = AX, (x), where Xy(x) = sinntx 


and A is an arbitrary constant. 


The values A, = n277”, and the corresponding solutions X,(x), are known as the eigen- 


values and eigenfunctions for the problem 


dx 


—~ = -AX with X(0) = X(1) =0. (E38.4) 
dx2 


38.5 By requiring the solution of (E38.4) to be non-zero we have restricted the possible 
values of 4 to the eigenvalues 4, = na. Find the solution of (E38.2) when 
i = hy, and hence show that one solution of (E38.1) is 


u(x, t) = (Asinntct + Bcosntict) sinntx. (E38.5) 
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Use the principle of superposition to show that 


(oe) 
u(x,t) = So(An sinntrct + By, cosntict) sinn tmx (E38.6) 
n=1 
solves (E38.1) for any choice of coefficients A, and B,. 

38.6 Assuming that any solution of (E38.1) can be written in the form (E38.6), the 
problem becomes to determine the coefficients A, and B,,. Show that if the initial 
position and velocity of the string, u(x, 0) and du/dt(x, 0), are given then A, and 
B, must satisfy 


lee) 
u(x,0) =) B, sinnax (E38.7) 
n=1 
and 


a CO 
5 0) = Sintec An SINNTX. 


n=1 
An expansion of a function f(x) as a sum of sine functions, 


CO 


f(x) = pe sin nx (E38.8) 
n=1 
is known as a Fourier series expansion of f. It is one of the wonders of mathematics that 
any reasonably smooth function f that has f(0) = f(1) = 0 can be expanded in such a 
series. (If we also include cosine functions then we can remove the restrictions at the end- 
points.) Finding the coefficients cy, is also relatively straightforward, at least in principle. 


38.7 Check that 


av . 0 nAm 
sinntx sinmimx dx = 4 1 
0 oy} n=m. 


(The functions sinn tx and sinmrx are orthogonal on [0, 1].) 
38.8 Multiply both sides of (E38.8) by sinmx and, assuming that it is possible to 
integrate the series term-by-term, show that the coefficient c,, is given by 


1 
Ch = 2 | f(x) sinmtx dx. 
0 


Sturm-Liouville theory treats the more general eigenvalue problem 


d dy : 
ax (poo) + q(x)y = Aw(x)y with y(a) = y(b) =0 
x dx 


(in equation (E38.3) we had p(x) = w(x) = 1, g(x) = 0, a = 0, and b = 1). There are, 
again, an infinite set of eigenvalues A, for which there is a corresponding non-zero eigen- 
function y,(x). The eigenfunctions are ‘orthonormal on [a, b] with respect to the weight 
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function w(x)’, 


b 
i Yn(*) Ym (x) w(x) dx = (E38.9) 


a 


i nAz~m 


1 n=m. 
Furthermore, any function f(x) satisfying the boundary conditions can be expanded as a 


generalised Fourier series using the eigenfunctions y, (x), 


CO 


OEY. Gin: (£38.10) 


n=1 
38.9 Using the orthonormality relation in (E38.9) show that the coefficients c,, in 
(E38.10) are given by 


b 
Cm = i f(&)ym w(x) dx. 


Appendix A: Real and complex numbers 


In this appendix we discuss some basic notation and properties of real and complex 
numbers. We use ‘e’ to denote ‘is an element of’. 


Real numbers 


We use R to denote the collection of all the real numbers, so that a € R simply 
means that w is a real number. We use curved brackets to denote the open end 
of an interval, and square brackets to denote the closed end of an interval, so for 
example 


x € [a, b] represents a<x <b, 
and 

x € (a, b] represents a<x <b. 
One end of the interval is allowed to be oo, for example 


t € (—0o, t*) represents t<t*. 


Complex numbers 


A complex number is a number z of the form z = x + iy, where i = /—1. Any 
complex number can be split into its real and imaginary parts, Re[z] and Im[z], 
where 


Re[z] = x and Im[z] = y. 


The rules for addition and multiplication of complex numbers follow from ap- 
plying standard algebra and the fact that i? = —1; we have 


(a+ib)+ (c+id) =(a+c)+ib+d) 
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and 
(a + ib)(c + id) = (ac — bd) + i(ac + bd). (A.1) 


The complex conjugate of a complex number z = x + iy is written z* and is 
given by z* = x —1y. Adding z and its complex conjugate yields twice the real 
part of z, 


z+2* = (x +iy) + @ —iy) = 2x = 2Re[z], 
while their difference gives i multiplied by twice the imaginary part of z, 
z—2* = (x +iy) — (x — iy) = 2iy = 2iIm[z]. 


Also useful is the fact that the complex conjugate of a product is the product of the 
complex conjugates, (wz)* = w*z*. In order to see this, note that 


(a + ib)*(c + id)* = (a — ib)(c — id) = (ac — bd) — i(ac + bd) 
= [(ac — bd) + i(ac + bd)]}* = [(a + ib)(c + id)]*, 


using (A.1). 
One important identity involving complex numbers is Euler’s formula, 


e = cos6 +isiné. (A.2) 


The easiest way to see this is to use the power series expansion of e%, 


(see Appendix C.) If z = i then we have 


a > coy" 
nN: 


n=0 


Since i? = —1, the even powers in the expansion are real, while the odd powers 


are still imaginary, and we get 


g2ntl 


=c a eS )! HCD Gna Dt 


The two sums on the right-hand side are just those for cos 6 and sin @ respectively 
(see Appendix C again), and so e!” = cos@ +isin@, as claimed. 

This gives another, often convenient form in which we can write any com- 
plex number, known as modulus and argument form. This is when we express 
z=x+iyasz=re'®. This is illustrated in Figure A.1, which shows z plotted on 
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z=x+ily 


y=rsin0 


X=rcosé 


Fig. A.1. The complex number z = x +iy written in modulus and argument 
form, z = re!?. 


the Argand diagram. (The Argand diagram shows the ‘complex plane’, where the 
real part of z gives the x coordinate and the imaginary part of z gives the y coordi- 
nate.) 

It follows that 


r= je+y? and 6 = tan™!(y/x). 


This is entirely analogous to using plane polar coordinates instead of standard 
cartesian coordinates, cf. (29.8). 


Appendix B: Matrices, eigenvalues, and eigenvectors 


This appendix covers the simple algebra of matrices, and some properties of eigen- 
values and eigenvectors. The calculation of eigenvalues and eigenvectors is the 
main topic of Chapter 27. 


Basic matrix algebra 


For the most part, we will only need to consider the algebra of 2 x 2 matrices, 


ee) 


Addition of two matrices is component by component, so that 


& Pee ae (ease Seti 
a2, 22 bo br a2, +b2,  a22 +b22 )’ 


while multiplication is given by 


& oe & ae Gries: seen 
a2} a22) \ ba ba a21b\, +422b2,  a2,b12 + a22b22 ) 


We can write this more compactly by saying that 


2 
[AB]; = aj1b1j +. ai2b2; = D— aixdej. 
k=l 


where [A\B];; is the entry in the ith row and jth column of the matrix AB. 
One special matrix is the identity matrix, 


1 0 
(6 a): 


which has the property that [A = Al = A for any 2 x 2 matrix A. 
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A matrix A is said to be invertible, or non-singular, if there is another matrix 
A! such that 


AA =AA7! =I. 


The matrix 
a b 
A= B.1 
ee B.1) 
is invertible if and only if its determinant, det(A), given by 
det(A) = 2 : = ad — bc 


is not equal to zero, and then 
| en meena 
ad—bc\-c a 


Matrices and vectors 


Multiplication of vectors by matrices 


In general we can calculate the product AB when A is ann x m matrix and B is 
an m x k matrix (the columns of A have to match the rows of B). In particular 
this allows us to calculate Ax if A is a 2 x 2 matrix and x = (x1, x2) is a two 


component vector, 
(a . (2) = as i) 
a21 a2 x2 a2\X1 + a22x2 


We can also write this more compactly as 


[Ax]; = aj1x1 + aj2x2, (B.2) 


where v; indicates the ith component of the vector v. (Note that this means in 
particular that if [v; v2] is a matrix with columns made from the vectors v1 and 
v2 then for a2 x 2 matrix A we have 


Alvi v2] = [Av Avo], 
which is used repeatedly in Chapters 28-30.) 


Solution of simultaneous equations 


The simultaneous linear equations 


ax, +bx2=c1 
cx, + dx2 = c2 
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can be rewritten as the matrix equation 
Ax = ¢, 


where A is defined as in (B.1), x = (x1, x2) and ¢ = (c1, C2). This equation has 
a unique solution if and only if A is invertible, and then the solution is given by 
multiplying both sides by A7!, 


x=—A-'e. 


It follows that Ax = 0 can have a non-zero solution for x only if A is not invertible. 


Eigenvalues and eigenvectors 


If v 4 0 and 
Av = Av 


then A is an eigenvalue of A and v is the corresponding eigenvector. The calcu- 
lation of eigenvalues and eigenvectors for 2 x 2 matrices is treated in detail in 
Chapter 27. 


Linear independence of eigenvectors 


The eigenvectors corresponding to two distinct eigenvalues are linearly indepen- 
dent; if 


av; + Bv2 =0 (B.3) 
then we can multiply both sides by A to obtain 
A(av, + Bv2) = 0. 
Since Av; = A; vj we have 
ar1V, + BAov2 = 0. (B.4) 


While (B.3) requires that v; = —Bv2/a, the second equation (B.4) says that v; = 
(A2/41) x (—Bv2/a). Since 42 4 A, these cannot both be true unless 6 = 0, in 
which case a = 0 also, and vj and vz are therefore linearly independent. 


The special case of symmetric matrices 


a a 
Kee 11 12 : 
a21 422 


The transpose of the matrix 
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written A’, is given by 


AT = | : 
ajo ay)” 

ie. [A?];; = [A]j;. A matrix is called symmetric if A = A’, ie. if [A]ij = [A] ji- 
A general 2 x 2 symmetric matrix is of the form 


a b 

b dad)’ 
For such matrices all the eigenvalues are real, and eigenvectors corresponding to 
distinct eigenvalues are orthogonal. 


To see that the eigenvalues are real, suppose that A is an eigenvalue and v = 
(v1, U2) is the corresponding eigenvector. Then 


Av = Av, and Av* = A*y*, 


where the second equation is the complex conjugate of the first. We take the inner 
(dot) product of the first equation with v*, and of the second with v, 


v*- Av =Alv/? and v-Av* = i*|v/?. (B.5) 


Now, the expression on the left-hand side of the first equation in (B.5) is 


2 2 
v'-Av= U; [Av]; = U; AjjVj 


i=l i,j=l 
2 
— ok oe . 
= y Uj Ajidj 
i,j=l 


2 


(ihe . ae ae 
= y Vj jidj 


i,j=l 
2 
— > vj[Av"]; =v- Av*, 
j=l 


and so is the same as the expression on the left-hand side of the second equation 
in (B.5). It follows that 


Alvi? = a* lv, 


i.e. A = X* and so this eigenvalue is real. 
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To see that the eigenvectors corresponding to distinct eigenvalues are orthogo- 
nal, suppose that Av = Av and Av = Arv™ with Ay # Ao. Then 


vO . Ay® = y). ayy = ay(y . y@), 


Looking at the left-hand side of this we have 


2 
Ca oe ae So Av); ae vagy? 
i 


2 
= 3 vO [AV]; =v). Ay), 


Now, 
v2. Ay) = yaw = ay . yy, 
and since v‘)) - Av@) = v2) . Av) we therefore have 


how vy?) = Aw) uy); 


(Az — Ay)(v . v) = 0. 


Since A2 4 41 we must have vy) .yQ = O, i.e. the eigenvectors are orthogonal. 


Appendix C: Derivatives and partial derivatives 


This appendix covers the definitions and properties of ordinary and partial deriva- 
tives, Taylor expansions in one and two variables, and some properties of the crit- 
ical points (turning points) of functions. 


Functions of one variable: ordinary derivatives 


We start by considering functions f(x) of one variable, and their derivatives. 


Definition and properties of the derivative 


Let J be an interval. A function f : J] —> R is differentiable at a point x € I if the 
limit 


i f@ +h) — f(x) 
im 


C.1 
h->0 h ( ) 


exists, in which case the limit in (C.1) is the derivative of f at x, which we write 


as (df/dx)(x) or f’(x). 
This basic definition implies the standard rules of differentiation. The product 
rule is 


[fel = f’g t+ fe’ 


the quotient rule is 


387 
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and the chain rule, which allows us to differentiate functions of functions, is! 


| (209) |= r'(e09) een. 


Taylor expansions 


Taylor’s Theorem allows us to expand a function f as a power series about a point 
Xo using its derivatives. Suppose that f has n + 1 derivatives, all of which are 
continuous functions, and that we use the notation 


in d"f 
FR) = T7@). 
Then we can write 
(x — x0)* 
f(x) = f (x0) + @& ~ x0) fo) + SF") + 
(x — x0)" A¢ (= xp) 
ef Goa ei PD ACD) 
for some point y, € (xo, x). 
Provided that the remainder term 
(x= x0)" ety 
ees i C3 
m+! i (Yn) (C.3) 


tends to zero as n tends to infinity, we can write f(x) as the power series 


c.0) d” oa n 
£0) = oy 


n 
a= dx 


known as the “Taylor expansion’ or “Taylor series’ for /. 


Power series 


We can use the Taylor expansion to find representations of common functions in 
terms of power series, i.e. an expression of the form 


Ce 
SS Anx”. 
n=0 
For example, if we take f(x) = e* and xo = 0 then 


d” 
dios for all a0 eee 
dx” 


! As written here we have confused the various functions that appear with their values at the point x. More 
correctly we should write [f o g]/(x) = f’(g(x))g/(x), where fog is the composition of f and g, ice. 
(f o g)(x) = f(g(x)). But such pedantry is probably unhelpful. 
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It follows that d” f/dx” (0) = 1 for all n, and so the Taylor series for e* is 


a . sai 
a n! 
For f(x) = sin x, the derivatives are 
d2" q2ntl 
f (x) = (-1)” sin x and f = (—1)" cosx. 


dx2" dx2n+! 


It follows that d2” f/dx?"(0) = 0 and d2”*+! oan = (—1)", so that 
> ; 5 
n 
sinx = = (-1 ) ae 


A similar calculation shows that 


2n 


cosx = xe yr Ont 


For a brief discussion of the convergence of such power series, see Chapter 20. 


Turning points 


A point xo is a turning point (or critical point) for f if f’(xo) = 0. We can find out 
whether such a point is a local maximum, a local minimum or a point of inflection, 
by using the Taylor series expansion near xo. If we assume that f” (xo) 4 0 and 
we keep only the first three terms from (C.2) then we have 


eer 
FO) Foo) + =F peo), 


where there is no second term since f’(x9) = 0. (Because we know that the re- 
mainder term is K(x — x)° for some constant K (cf. (C.3)), we can be sure 
that sufficiently close to xo the last term that we have kept is larger than all the 
terms that we have neglected.) It is easy to see from here that if (x9) < 0 then 
f(x) < f (xo) close to xo, ie. that f has a maximum at xo; while if f” (x0) > 0 
then it follows that f(x) > f (xo) close to xp and so xg is a minimum of f. 

If f” (xo) = 0 and f’” (xo) 4 0 then we can take one further term of the Taylor 
expansion to find 


(x — x0)? 


0). 


f(x) = f(%o) + 


Since (x — xo)? changes sign near xo, it follows that xo is now a point of inflection. 
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Functions of two variables: partial derivatives 


We now treat similar topics for functions of two variables. 


Partial derivatives and their properties 


If f(x, y) is a function of two variables then the partial derivative of f with respect 
to x is found by treating y as a constant and differentiating with respect to x, 


af fa th, y)— f@,y), 


“(x y) = li 
a A 


similarly, 0f/dy is found by keeping x constant and differentiating with respect 
to y. 

Since df /dx is in general another function of x and y it is possible to take partial 
derivatives again, e.g. 


vf a (af rf a (af 

ax2 ax \ax J’ dydx dy \ax/)” 
One useful property of partial derivatives is that the order in which they are taken 
does not matter, so in particular we have 


af = af 
dydx dx dy’ 


(C.4) 


ie. it makes no difference if we take the partial derivative with respect to x and 
then with respect to y, or vice versa. 

Obvious generalisations of the product and quotient rule apply to partial deriva- 
tives. When f is a function of x and y, and x and y depend on the same variable 
t, we have the following version of the chain rule. In this case, when f (x(t), y(t)) 
is in fact a function of the single variable r, 


< F(x, vo) = A(x, vo) Feo + (x00 v0) 200. 


Taylor expansions 


There is a two-dimensional version of Taylor’s Theorem, which allows us to ex- 
pand f(x, y) near a point (xg, yo) as a series involving the partial derivatives of f 
at (xo, yo). Writing € = x — xp and n = y — yo, and only giving the terms up to 


Appendix C Derivatives and partial derivatives 391 


third order we have 


= Of Or eee oF rs) 
LOW FCI) Fes San weawe (« Re aay ay2 
aOF apa 2 af rt) ” 

+a (PSS POE ee TOT gag a) 


where all the partial derivatives are evaluated at the point (xo, yo). (It is possible to 
check this by first doing a Taylor expansion with respect to x keeping y fixed, and 
then Taylor expanding each of these terms with respect to y.) 

In Chapter 32 we will use the simplest non-trivial consequence of this expan- 
sion, where we keep only the terms that are linear in € and n, 


) ) 
f (xo + &, yo + n) = f (xo, yo) +8 (20, v0) + ne yo) +++: 


Critical points 


In this section we will use the shorthand notation 


af of af 
are fy = ay Fry = Fay’ 
etc. 
A point (xo, yo) is called a critical point of f if fry = fy = 0, Le. if 
0 
af = and af = 
Ox oy 


We can use the Taylor expansion of f near this point to see what kind of critical 
point we have. Keeping only terms up to second order (as we did for functions of 
one variable) we have 


f(x, y) = f (xo, yo) + Efe + afy + 5 (E7 fox + 2Enfey +10" fyy) + 


We can remove the terms linear in and n (since fy = fy = 0 at the critical point), 
and then rewrite the quadratic terms using matrix notation to give 


Fo + & yo +) © F(X. yo) + LE 0) Ge - eae (C.5) 
yx Jyy 1 


The matrix of second partial derivatives, 


_ Sex a 
H= : 
Gs fry 
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Fig. C.1. Plots of a surface whose height is f(x, y), and contour plots of f, near a 
point (xo, yo) which is, from top to bottom, a maximum, a minimum and a saddle 
point. 


is known as the Hessian matrix. Just as the type of the critical point xo for a func- 
tion of one variable is determined by the sign of f” (x0), so the type of the critical 
point of f(x, y) is determined by the properties of this matrix of second deriva- 
tives, and in particular by the sign of its eigenvalues. 

Because fyy = fyx (see (C.4)) the Hessian matrix is symmetric. It is a general 
result that a real symmetric matrix has real eigenvalues A, and Az and that if these 


Appendix C Derivatives and partial derivatives 393 


eigenvalues are distinct then the corresponding eigenvectors v; and v2 are orthog- 
onal, 


vi: v2 = 0, 


see Appendix B. Suppose that we use this notation for the eigenvalues and eigen- 
vectors of the Hessian matrix H, and choose the eigenvectors so that they have 
length 1, i.e. |v;| = 1. Now if we write the vector (€, 7) in terms of the eigenvec- 


tors of the Hessian matrix, 
tC) =av, + Bv2, 


s[ovi + Bv2]- Hlovy + Bv2] = Slav + Bv2] - [waivi + Barve] 
= y[07A1 + 6rd]. 


the second term in (C.5) is 


So we have 


f(xo + & yo +n) © f (xo, yo) + 5(@7A1 + B7A2); 


the behaviour of f near (xo, yo) does indeed depend on the eigenvalues of H. 
If both eigenvalues are positive then f(x, y) > f(x, yo) close to (x9, yo), So we 
have a minimum; if both eigenvalues are negative then f(x, y) < f (x9, yo) close 
to (xo, yo) and this point is a maximum. If the two eigenvalues have opposite signs 
then we have a saddle point: f increases in one direction (the direction of the 
eigenvector corresponding to the positive eigenvalue), and decreases in another 
(the eigenvector for the negative eigenvalue). Plots of f against x and y, and the 
contour plots of curves of constant f in each of these three cases, are shown in 
Figure C.1. Note that near a maximum or minimum of f the curves of constant f 
are closed. 
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initial condition, 26, 38 
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second order 
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Lorenz equations, 364, 374 

sensitive dependence on initial conditions, 370 
Lotka—Volterra systems, 323 


manifold 
stable, 281, 317, 326-327, 356 
unstable, 281, 317 
Matias 
airy (Airy function), 181 
and Euler’s method, 204 
besselj (Bessel function of Ist kind), 181, 190 
bessely (Bessel function of 2nd kind), 181, 
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contour (contour plot), 21 
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M-files on the web, 2 
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lportrait.m, 283 
makematrix.m, 283 
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rungekutta.m, 212 
solvem.m, 370 
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ode45 (ODE solver), 15, 18, 209 
quad (integration), 19 
quiver (direction field), 311 
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and linear transformations, 303 
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eigenvalues, see eigenvalues 
eigenvectors, see eigenvectors 
similar, 304 
symmetric, 392 
matrix algebra, 382 
maximal interval of existence, 42 
Millennium Bridge, 128 
modulus and argument form, 380 


Navier-Stokes equations, 12, 34, 42 
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law of cooling, 79, 87 

second law of motion, 12, 29 
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node 
improper, 295, 301 
stable, 277, 301 
unstable, 278, 301 
non-singular matrix, 383 
non-uniqueness of solutions of ODEs, 39, 74 
nonlinear pendulum, 352 
with quadratic damping, 358 
nullclines, 312, 340 


ODE, see differential equation 
orbit, 224 
periodic, see periodic orbit(s) 
order 
of a difference equation, 213 
of a differential equation, 13 
ordinary point, 178 
oscillation(s), 81, 117, 120 
combining two oscillations of the same 
frequency, 83, 120 
damped, 123 


partial derivatives, 390 
partial differential equations, 12, 373 
particular integral (for ODE), 132 
exponential right-hand side, 135 
polynomial right-hand side, 133 
resonace, see resonance 
rule of thumb, 139 
sine or cosine on right-hand side, 137, 143 
via variation of constants formula, 164 
particular solution (for difference equation), 219 
4” on the right-hand side, 221 
polynomial on right-hand side, 219 
pendulum 
linear approximation, 122 
nonlinear equation, 352 
damped, 356 
period, 82 
of decaying oscillations, 126 
period-doubling cascade, 237, 246 
periodic orbit(s), 226, 230, 290, 301, 360 
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phase diagram, 47 
analytic method for coupled equations, 3/4 
general comments, 309 
linear matrix equation 
complex eigenvalues, 287, 29] 
distinct real eigenvalues (coupled), 279 
distinct real eigenvalues (uncoupled), 276 
summary of possibilities, 301 
phase portrait, see phase diagram 
phase space, 56 
pitchfork bifurcation, 54 
bifurcation diagram, 56 
Poincaré—Bendixson Theorem, 36/ 
Poiseuille flow, 35 
polar coordinates, 289, 381 
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logistic equation, 5/, 66 
Malthus’ linear model, 62 
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convergence, 177 
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product rule, 387 
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quadratic damping 
air resistance, 52, 73 
nonlinear pendulum, 358 
quotient rule, 387 
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radiocarbon dating, 6 
radius of convergence, 177 
ratio test, 177 
real numbers, 379 
recurrence relation, 179 
for Airy equation, 182 
for Bessel’s equation, 188 
near a regular singular point, 186 
reduction of order, 159, 168 
for xe** solution of 2nd order ODE, 160 
for log solution of Cauchy—Euler equation, 
172 
regular singular points, 183 
log solution, 186 
resonance, 14] 
response to periodic forcing, 147 
Runge-Kutta scheme, 209, 212 


saddle (point), 278, 301, 393 
saddle node bifurcation, 57 
second order equation 
Cauchy—Euler equation, 170 
constant coefficient linear equation 
homogeneous case, 111 
inhomogeneous case, 131 
existence and uniqueness, 25/ 
general theory for linear equations, 101 
reduction of order technique, 159 
series solutions, 176 
variation of constants formula, 164 
semi-stable stationary point, 50 
sensitive dependence on initial conditions 
in the logistic map, 241 
in the Lorenz equations, 370 
separation of variables 
for ODEs, 60 
for radially symmetric wave equation, 
198 
for the vibrating string, 375 
separatrix, 281, 326, 327, 356 
series solutions, 176 
similar matrices, 304 
simple harmonic motion, 117, 121 


simultaneous equations, 383 
solution 
explicit, 14 
general, 25 
implicit, 14 
of scalar IVP (definition), 40 
of vector IVP (definition), 250 
qualitative, 15 
solutions of exercises, 2 
species 
competing, 71, 3/0, 323 
coexistence, 327 
no periodic orbits, 361 
weak competition, 323 
cooperative, 339 
predator-prey, 331, 334 
spiral 
stable, 290, 301 
unstable, 290, 301 
spring, 120, 126 
spring constant, 120 
stable 
fixed point, 225 
manifold, 281, 317, 326-327, 356 
node, 277, 301 
periodic orbit, 203 
spiral, 290, 301 
stationary point, 48 
stars, 295, 301 
stationary point, 47, 309 
attracting, 48 
semi-stable, 50 
stable, 48 
unstable, 48 
step size, 201 
string, 375 
structural stability, 50 
Sturm-Liouville theory, 373, 377 
substitution method 
for Bernoulli equations, 96 
for homogeneous equations, 94 
to solve Cauchy—Euler equation, 174 
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superposition principle, 103 
for solutions of linear matrix equation, 259 
for vibrating string, 375 

symmetric matrix, 384, 392 


Tartaglia, 151 

Taylor expansion, see also power series 
for functions of one variable, 203, 225, 388 
for functions of two variables, 315, 390 

tent map, 243, 372 

terminal velocity, 52 

timestep, 201 

trajectories, 310 
cannot cross in phase plane, 310 

transcritical bifurcation, 57 

trapezoidal Euler method, 210, 231 

turning point, 389 


uniqueness of solutions of ODEs, see also existence 
and uniqueness 252 
example of non-uniqueness, 39 
proof for scalar ODE, 44 
proof for vector ODE, 252 
unstable 
fixed point, 225 
manifold, 281, 317 
node, 278, 301 
spiral, 290, 301 
stationary point, 48 


variable(s) 
independent, 11 
of integration, 9 

variation of constants formula, 164 

vector first order equations, 249 
definition of solution, 250 
existence and uniqueness, 250 
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